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Preface 


Mathematics is an essential tool for physicists and engineers which students must 
use from the very beginning of their studies. This combination of textbook and study 
guide aims to develop as rapidly as possible the students’ ability to understand and 
to use those parts of mathematics which they will most frequently encounter. Thus 
functions, vectors, calculus, differential equations and functions of several variables 
are presented in a very accessible way. Further chapters in the book provide the 
basic knowledge on various important topics in applied mathematics. 

Based on their extensive experience as lecturers, each of the authors has acquired 
a close awareness of the needs of first- and second-years students. One of their aims 
has been to help users to tackle successfully the difficulties with mathematics which 
are commonly met. A special feature which extends the supportive value of the 
main textbook is the accompanying “study guide". This study guide aims to satisfy 
two objectives simultaneously: it enables students to make more effective use of the 
main textbook, and it offers advice and training on the improvement of techniques 
on the study of textbooks generally. 

The study guide divides the whole learning task into small units which the stu- 
dent is very likely to master successfully. Thus he or she is asked to read and study 
a limited section of the textbook and to return to the study guide afterwards. Learn- 
ing results are controlled, monitored and deepened by graded questions, exercises, 
repetitions and finally by problems and applications of the content studied. Since the 
degree of difficulties is slowly rising the students gain confidence immediately and 
experience their own progress in mathematical competence thus fostering motiva- 
tion. In case of learning difficulties he or she is given additional explanations and in 
case of individual needs supplementary exercises and applications. So the sequence 
of the studies is individualised according to the individual performance and needs 
and can be regarded as a full tutorial course. 

The work was originally published in Germany under the title “Mathematik für 
Physiker" (Mathematics for physicists). It has proved its worth in years of actual 
use. This new international version has been modified and extended to meet the 
needs of students in physics and engineering. 
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viii Preface 


The CD offers two versions. In a first version the frames of the study guide are 
presented on a PC screen. In this case the user follows the instructions given on the 
screen, at first studying sections of the textbook off the PC. After this autonomous 
study he is to answer questions and to solve problems presented by the PC. A second 
version is given as pdf files for students preferring to work with a print version. 

Both the textbook and the study guide have resulted from teamwork. The au- 
thors of the original textbook and study guides were Prof. Dr. Weltner, Prof. 
Dr. P.-B. Heinrich, Prof. Dr. H. Wiesner, P. Engelhard and Prof. Dr. H. Schmidt. 
The translation and the adaption was undertaken by the undersigned. 


Frankfurt, August 2009 K. Weltner 
J. Grosjean 
P. Schuster 
W. J. Weber 
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Chapter 1 
Vector Algebra I: Scalars and Vectors 


1.1 Scalars and Vectors 


Mathematics is used in physics and engineering to describe natural events in which 
quantities are specified by numerical values and units of measurement. Such a de- 
scription does not always lead to a successful conclusion. 

Consider, for example, the following statement from a weather forecast: 

‘There is a force 4 wind over the North Sea.’ 

In this case we do not know the direction of the wind, which might be important. 
The following forecast is complete: 

‘There is a force 4 westerly wind over the North Sea.’ 

This statement contains two pieces of information about the air movement, 
namely the wind force which would be measured in physics as a wind velocity 


Fig. 1.1 
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in metres per second (m/s) and its direction. If the direction was not known, the 
movement of the air would not be completely specified. Weather charts indicate the 
wind direction by means of arrows, as shown in Fig. 1.1. It is evident that this is 
of considerable importance to navigation. The velocity is thus completely defined 
only when both its direction and magnitude are given. In physics and engineering 
there are many quantities which must be specified by magnitude and direction. Such 
quantities, of which velocity is one, are called vector quantities or, more simply, 
vectors. 

As an example from mathematics, consider the shift in position of a point from 
P, to P5, as shown in Fig. 1.2a. This shift in position has a magnitude as well as 
a direction and it can be represented by an arrow. The magnitude is the length of the 
arrow and its direction is specified by reference to a suitable coordinate system. It 
follows that the shift of the point to a position P3 is also a vector quantity (Fig. 1.2b). 


Р, 


Р, 


(b) 


Fig. 1.2 


A figure in a plane or in space can be shifted parallel to itself; in such shifts the 
direction of all lines of the figure are preserved. Figure 1.3 shows a rectangle shifted 
from position A to position B where each point of the rectangle has been shifted 
by the same amount and in the same direction. Shifts which take place in the same 
direction and are equal in magnitude are considered to be equal shifts. A shift is 
uniquely defined by one representative vector, such as a in Fig. 1.3. Two vectors are 
considered to be equal if they have the same magnitude and direction. 


Fig. 1.3 
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Furthermore, vectors may be shifted parallel to themselves, as shown in Fig. 1.4a, 
if the magnitudes and directions are preserved. 
A vector may also be shifted along its line of action, as shown in Fig. 1.4b. 


ГА 


Vectors can be combined in various ways. Let us consider the addition of vectors. 
Consider the point Рі in Fig. 1.5 shifted to P2, and then shifted again to Рз. Each 


a КЕ 
shift is represented by a vector, i.e. Р1Р2 and P2P3, and the result of the two shifts 


by the vector P;P3. Hence we can interpret the succession of the two shafts as the 
sum of two vectors giving rise to a third vector. 


M 
Р, 
Р, 
Р, 
о x 
Fig. 1.5 


The length of the vector representing a physical quantity must be related to the 
unit of measurement. 


Definition Vectors are quantities defined by magnitude and direction. The 
geometrical representation of a vector is by means of an arrow 
whose length, to some scale, represents the magnitude of the 
physical quantity and whose direction indicates the direction of 
the vector. 
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On the other hand, there are physical quantities, distinct from vectors, which are 
completely defined by their magnitudes. Such quantities are called scalar quantities 
or, more simply, scalars. 


Definition A scalar quantity is one which is completely defined by its mag- 
nitude. 


Calculations with scalar quantities follow the ordinary rules of algebra with pos- 
itive and negative numbers. Calculations with vectors would appear, in the first in- 
stance, to be more difficult. However, the pictorial geometrical representation of 
vector quantities facilitates this task. With vectors it is possible to describe physical 
situations concisely. A clear notation is needed to represent vector quantities and 
there are, in fact, a number of notations in use. 

Vectors are represented by 


1. two capital letters with an arrow above them to indicate the sense of direction, 


e.g. Py P2 where P, is the starting point and P5 the end point of the vector; 
2. bold-face letters, e.g. a, A (the style used in this text); 
— 


W 


letters with an arrow above, e.g. a, A; 
4. underlined letters, e.g. a, and occasionally, by a squiggle underneath the letter, 
e.g. a. 


To distinguish the magnitude of a vector a from its direction we use the mathemati- 
cal notation 
|а| = а 


The quantity [а | is a scalar quantity. 


1.2 Addition of Vectors 


Geometrically, vectors may be combined by defining easy rules. 
It is important that the results (sum, difference) should correspond exactly to the 
way actual physical quantities behave. 


1.2.1 Sum of Two Vectors: Geometrical Addition 


Previously the sum of two vectors was shown to be made up of two shifts, i.e. the 
result of two successive shifts was represented by means of another shift. If two 
vectors а and b are to be added so that their sum is a third vector c then we write 


c=a+b 
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Consider the two vectors a and b, shown in Fig. 1.6a, with a common origin at A. 
We can shift vector b parallel to itself until its starting point coincides with the end 
point of vector a (see Fig. 1.6b). As a result of this shift we define the vector c as 
a vector starting at A and ending at the end of vector b (see Fig. 1.6c). Then с is the 
vector sum of the two vectors а and b and is called the resultant. The triangle law 
of addition of vectors is expressed by the vector equation 


c=a+b 
Áo BEDA 
A a A a A a 
(a) (b) (c) 


Fig. 1.6 


The sum of several vectors is obtained by successive application of the triangle law; 
a polygon is formed as illustrated in Fig. 1.7. 


= Resultant \ 


Fig. 1.7 


The order in which the vectors are added is immaterial. This is known as the 
commutative law, i.e. a + b = b + a. Furthermore, the law for addition of vectors is 
associative, i.e. if a, b and c are three vectors then their sum is 


a+(b+c)=(a+b)+ce (1.1) 


This means that we could add to a the sum of b and c or find the sum of a and b 
and add it to c and still obtain the same resultant. 

Vector addition also follows Newton's parallelogram law of forces which applies 
to two forces acting at a point, as shown in Fig. 1.8. The vector sum of two such 
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vectors а and b is obtained by drawing two lines parallel to the vectors a and b, 
respectively, to form a parallelogram. The vector sum is then represented by the 
diagonal AB; hence c = a +b. A study of the figure shows that it is equivalent 
to the triangle law and that c — AB is obtained by either adding b to a or a to 
b, i.e. 

c=a+b=b+a (1.2) 


Fig. 1.8 


1.3 Subtraction of Vectors 


The method of subtraction for two vectors is obtained by an extension of the rule of 
addition if we first introduce the concept of a negative vector. 


Definition The negative of a vector a is a vector having the same magni- 
tude but opposite direction. We write it as —a. 


—> 
If the vector a starts at A and ends at B, such that a = AB, then it follows that 
—a = BA. 
The sum of a vector and its negative counterpart is zero, for 


a+(—a)=0 


In vector calculus, 0 (as above) is called the null vector. 
If a and b are two vectors, then we call a third vector c the difference vector 
defined by the equation 
c=a-—b 


We can regard this difference as the sum of vector a and the negative of vector b, 
ie.c =a+(—b). 

This result is illustrated in Fig. 1.9 in three steps. 

Firstly, we draw the negative vector —b (Fig. 1.9a); secondly, we shift this neg- 
ative vector so that its end is at the tip of vector a (Fig. 1.9b); and thirdly, we form 
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b 
a a a 
>] —b 
—b, 
(a) 


(b) (c) 
Fig. 1.9 


the sum of a and (—b) in accordance with the triangle law and obtain the difference 
vector c = a + (—b) (Fig. 1.9c). 

To add and subtract vectors we proceed using the rules for addition and subtrac- 
tion. 

The difference vector c — a — b can also be constructed using the parallelogram 
rule. Figure 1.10a shows two vectors a and b; in Fig. 1.106 the parallelogram is com- 


pleted. The difference vector c = a — b is then given by the diagonal BA (Fig. 1.10c). 


(a) (b) (c) 
Fig. 1.10 


Itis easy to see that both constructions lead to the same result, and that the latter 
construction shows clearly that the difference vector can be regarded geometrically 
by the line joining the end points of the two vectors. 


1.4 Components and Projection of a Vector 


Let us consider the shift of a point from position P, to position P5 by the vector a, 
as shown in Fig. 1.11a, and then find out by how much the point has shifted in the 
x-direction. To ascertain this shift we drop perpendiculars on to the x-axis from the 
points Ру and P» respectively, cutting the axis at x; and x2. The distance between 
these two points is the projection of the vector a on to the x-axis. This projection is 
also called the x-component of the vector. 

In the figure, we have shown a rectangular set of axes, i.e. axes which are per- 
pendicular to each other so that point P; has coordinates (x1, y1 ) and point P2 coor- 
dinates (x2, y2). It follows, therefore, that the x-component of the vector a is given 
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(a) (b) 


Fig. 1.11 


by the difference between the x-coordinates of the points P, апар», i.e. by x2 — хі. 
Similarly, the shift of the point in the y-direction is obtained by dropping perpendic- 
ulars from P, and P» on to the y-axis, cutting it at yı and y2, as shown in Fig. 1.11b. 
Hence the y-component of the vector a in the y-direction is given by y2 — y1. 

The components of the vector a are usually written as follows: 


ах = X2 — X1 
ау = Y2— 91 


If the coordinate axes are not perpendicular to each other the coordinate system is 
called oblique. Projections in such a coordinate system are obtained by using lines 
parallel to the axes instead of perpendiculars. We will not use this type of coordinate 
system in this book, even though oblique coordinates are very useful in certain cases, 
e.g. in crystallography. 

Generalisation of the concept of projection. So far we have considered the pro- 
jection of a vector on to a set of rectangular coordinates. We can generalise this 
concept by projecting a vector a on a vector b as follows. 

We drop normals from the starting and end points of the vector a, as shown in 
Fig. 1.12a, on to the line of action of vector b. The line of action of the vector 
is the straight line determined by the direction of the vector. It extends on either 


Fig. 1.12 


1.5 Component Representation in Coordinate Systems 9 


side of the vector, as shown in Fig. 1.12a. The distance between the two normals is 
the component of vector a along vector Р; this is written as ар. We can simplify the 
construction by shifting the vector a parallel to itself until its starting point meets 
the line of action of vector b and then dropping a normal on b from the tip of vector 
a, as shown in Fig. 1.12b. This results in a triangle and the projection or component 
of a in the b direction is 


|45| = |a|cosa 


or ар = acoso 
Similarly, we can project vector b on to a, giving 


ba = bcosa 
1.5 Component Representation in Coordinate Systems 


The graphical addition and subtraction of vectors can easily be carried out in a plane 
surface, e.g. in the x— y plane of a rectangular coordinate system. However, we 
frequently have to cope with spatial problems. These can be solved if the vector 
components in the direction of the coordinate axes are known. We can then treat the 
components in each of the axes as scalars obeying the ordinary rules of algebra. 


1.5.1 Position Vector 


The position vector of a point in space is a vector from the origin of the coordinate 
system to the point. Thus, to each point P in space there corresponds a unique vector. 
Such vectors are not movable and they are often referred to as bound vectors. 

The addition of two position vectors is not possible. But in contrast subtracting 
them gives a sensible meaning. If P and P» are two spatial points, i.e. two positions 
in space, and O is the origin of a coordinate system, then the position vectors are 
OP; and ОР, and their difference, OP; — OP; , is a vector starting at P1 and ending 
at P2, as shown in Fig. 1.13. 


Fig. 1.13 
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1.5.2 Unit Vectors 


Vectors have magnitude and direction, but if we wish to indicate the direction only 
we define a unit vector. A unit vector has a magnitude of | unit; consequently it 
defines the direction only. 

Figure 1.14 shows three such unit vectors (bold arrows) belonging to vectors a,b 
and c respectively. 

Of special significance are unit vectors along a Cartesian or rectangular coordi- 
nate system (see Fig. 1.15). The set of these vectors is also called a base and these 
vectors are called base vectors. In such a three-dimensional system, these unit vec- 
tors are denoted by the letters i , j,k orex,ey,ez Or €1,€2,€3. Here we shall adopt 
the i , j,k notation. 


Fig. 1.14 Fig. 1.15 


Figure 1.16 shows a point P with coordinates P4,Py,P;z. The position vector 
= 
ОР = Р has three components: 
e acomponent Р, i along ће x-axis; 


* acomponent P, j along the y-axis; 
* acomponent P; k along the z-axis. 
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From the figure it then follows that 


2 
OP = P =Р,і +Pyj +Pzk 


1.5.3 Component Representation of a Vector 


A vector can be constructed if its components along the axes of a coordinate system 
are known. Hence the following information is sufficient to fix a vector: 


e a defined coordinate system; 
* the components of the vector in the direction of the coordinate axes. 


Fig. 1.17 


Figure 1.17 shows a rectangular x-y-z coordinate system. If the vector a has 
components 4,4 y,az then 


а = ахі +dyj Һа; 
It can also be expressed іп the abbreviated forms 


a = (dy ,dy,az) 


Thus the vector a is defined by the three numbers ax ,ay,az. To obtain the vector 
а we simply multiply these numbers by the appropriate unit vectors. ax,ay and az 
are the *coordinates' of the vector; they are scalar quantities. 
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Definition We may express a vector 


a = axi +ayj +azk 
ax 
thus: а = (dx,dy,dz) = | dy 
az 


These are called the component representations of the vector a. 


Example The vector shown in Fig. 1.18 is given by a = (1,3,3). 


Fig. 1.18 


Two vectors are equal if and only if their components are equal. Hence if a = b, 
then 


ах = by 
ау = by 
dz = bz 


1.5.4 Representation of the Sum of Two Vectors 
in Terms of Their Components 


We will now show that the result of the geomet- 
rical addition of two vectors can be obtained by 
adding separately the components of the vectors 
in given directions. 

Two vectors a and b in the x-y plane, as shown 
in Fig. 1.19a, can be expressed in terms of unit 
vectors thus: 


a —dxi +ауј 


and 


b—bubyj 
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We now add a and b to give the resultant vector yÅ 
c, as shown in Fig. 1.19b: 


c=a+b 


The x-component of с is cxi = axi + Ьі or 
Cxi = (ax +bx )i . Hence the x-component of the 
resultant vector is equal to the algebraic sum of 
the x-components of the original vectors. 


Similarly, the y-component (see Fig. 1.19c) is 


су) = (ay t by)j 


Fig. 1.19 


It then follows that the vector c, the resultant of vectors a and b, is given by 
c = (ax +bx)i + (ay +by)j 


whose coordinates are (ax + by), (ay 4- by). 
The same procedure can be adopted in the case of three-dimensional vectors. 
If a and b are two such vectors so that 


à = (dx,dy,dz) and b = (bx, by, bz) 
then it follows that 
a+b= (ax+bx, ay-cby, az+bz) (1.3a) 


Generally, the sum of two or more vectors is found by adding separately their com- 
ponents in the directions of the axes. 
1.5.5 Subtraction of Vectors in Terms of their Components 


The task of finding the difference a — b between two vectors a and b can be reduced 
to that of adding vector a and the negative of vector b. 
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It therefore follows that, for the two-dimensional case 
a—b-—(ax—bx, ау —by) 
and for the three-dimensional case 


а= (6g — bg, ay by, 825) (1.36) 


Example Let the vectors be a = (2,5,1) and b = (3,— 7,4) then, in terms of the 
components, we have 


a—b-(2—3, 5-7, 1-4)-1-1,12,-3) 


Of special significance is the difference vector of two position vectors. This is given 
by the vector which joins the end points of the two position vectors. 

Figure 1.20 shows that vector с is obtained by joining the two points Ру and P2 
so that c = Ру — Po. 

In terms of the components of P ; and P5, 


c=(Pix—P2x, Piy— Pay) 


Fig. 1.20 


Example If P, = (3, — 1,0) and P2 = (—2,3,— 1) are two points in space then the 
difference vector с given by с = P5 — P, is 


с-(-2-3, 3-1, -1-0)=(-5,4,-1) 


1.6 Multiplication of a Vector by a Scalar 


Multiplication of a vector by a scalar quantity results in a vector whose magnitude 
is that of the original vector multiplied by the scalar and whose direction is that of 
the original vector or reversed if the scalar is negative. 
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Definition Multiplication of a vector a by a scalar A gives the vector Aa 
having length Aa and the same direction as a when A > 0. If 
А < 0 it has the opposite direction. 


In terms of the components of the vector a, the new vector Аа is given by 
Ха--(Хах, Ady, Аал) (1.4) 
If A = 0, then the vector Aa is the null vector (0, 0, 0). 
Example Given a = (2,5,1), then when A = 3 we have 
Ла = (6, 15,3) 
and when A = —3 we have 


Aa = (—6,—15,—3) 


1.7 Magnitude of a Vector 


If the components of a vector in a rectangular coordinate system are known, the 
magnitude of the vector is obtained with the aid of Pythagoras’ theorem. 

Figure 1.21 shows a vector a with components х, dy, i.e. а = (ax, ay). 

Since the vector and its components form a right-angled triangle, we have 


aa +ау? 


апа the magnitude of the vector is 


la| =a = \/ax? ay? (1.5а) 


Fig. 1.21 
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Fig. 1.22 


The three-dimensional vector a = (ax,ay,az) shown in Fig. 1.22 has a magnitude 


given by 
|а| 2 a = /ax? +ay? +az? (1.5b) 


Example The magnitude of the vector a = (3,—7,4) is 


а= ү32 +72 +42 = 4/74 x 8.60 


The distance between two points in space is thus easily determined if ће compo- 
nents are known. 


Example Figure 1.23 shows two given points in the plane P, = (x1,y1) and 
P2 = (x2, y2). It is required to find the distance between them. 


——— 
To find the distance we require the coordinates of the connecting vector P2P}. 
These are 
—> 
PoP) = (x1 — x2, Y1 — Y2) 


and the magnitude is 


—> 
|PoPi| = yea —X2)? + (yı - y2)? 


Fig. 1.23 
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If P, and P5 are two points in space, then the distance between them is 


— 
|P2P1| = "m = x2)? + (y1 — y2) + (21 – 22)? 


Any vector may be expressed in terms of a unit vector. If a = (ax,ay,az) is any 


vector, its magnitude is 
|а| =a = 4/ax? ay? t az? 


If the unit vector in the direction of a is denoted by eg, then 


ахі +dyj Һа; 


or 
POTE UR (is fy ia) 
la| |а|” 1417 |a| 
Непсе 
а = aeg 
Exercises 


1.1 Scalars and Vectors 


1. Which of the following quantities are vectors? 


(a) acceleration (b) power 

(c) centripetal force (d) velocity 

(e) quantity of heat (f) momentum 

(g) electrical resistance (h) magnetic intensity 


(i) atomic weight 


1.2 Addition of Vectors 1.3 Subtraction of Vectors 


2. Given the vectors a, b and c, draw the vector sum S = a + b + c in each case. 


Fig. 1.25 
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3. Draw the vector sum a, 4- a2 +...+ 4p. 


л 


x 
aj a; 
a3 
ад 
as 
a 
a; s 
S -a 
о | x [ х 


Fig. 1.26 Fig. 1.27 


4. Draw the vector c — a — b. 


Fig. 1.28 Fig. 1.29 


1.4 Components and Projections of a Vector 


5. Project vector a on to vector Ё. 


Fig. 1.30 


Exercises 19 


6. Calculate the magnitude of the projection of a on to b. 


(a) (4|-5, < (a, 9-2 (b (4|-2, (a, 5-7 
(0 (4-4, < (a, b) =0 © 1-3) (а, В) = 2л 


1.6 Component Representation 


7. Given the points P, = (2, 1), P2 = (7, 3) and Рз = (5, —4), calculate the 
coordinates of the fourth corner P4 of the parallelogram P;P2P3P4 formed by 
a шин 
the vectors a = P,P» and b = P|P3. 


Fig. 1.32 


8. If Pi = (x1, y1), P2 = (хо, y2), Рз = (хз, уз) and P4 = (x4, ya) are four 
arbitrary points in the x — y plane and if a = PjP5, b = Р›Рз, с = P3P4, d = 
——— 


P4P,, calculate the components of the resultant vector S =a+b+c+d and 
show that 5 = 0. 
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9. A carriage is pulled by four men. The components of the four forces F1, Ёо, 
Ез, Ед are 


F, = (20N, 25N) F, 
F> = (15N, 5N) 
F3 = (25N, —5N) 
F4 = (30N, —15N) F, 
F; 
F, 
Fig. 1.34 


Calculate the resultant force. 


10. Ifa = (3, 2, 1), b= (1, 1, 1), e = (0, 0, 2), calculate 
(a) а49-с (b) 2a—b+3c 


1.6 Multiplication of a Vector by a Scalar 


11. Calculate the magnitude of vector a = 41a, +A2a2 —A3a3 for the follow- 
ing cases: 


(a) ау = (2; -3, 1), a» = (-1, 4, 2); аз = (6, -1, 1), 
i Беа ДЫ 


12. Calculate in each case the unit vector eg in the direction of a: 
(а) a= (3, —1, 2) (b) а= (2, —1, —2) 


1.7 Magnitude of a Vector 


13. Calculate the distance a between the points P, and P» in each case: 
(a) Py = (3, 2, 0) (b) Pi = (2; -1, 3) 
P2 = (-1, 4, 2) P5 = (4, -2, -1) 


14. An aircraft is flying on a northerly course and its velocity relative to the air is 
Vi = (0 km/h, 300 km/h) 
Calculate the velocity of the aircraft relative to the ground for the following 
three different air velocities: 


(a) V2 = (0, —50) km/h, headwind 
(b) V3 — (50, 0) km/h,  crosswind 
(c) V4 = (0, 50) km/h,  tailwind 
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Fig. 1.35 


Calculate the magnitude of the absolute velocity relative to the ground for the 
three cases: 


(9) |У, + У] (е) |V1 + V3| (f) |V1 + V4| 


Chapter 2 
Vector Algebra II: Scalar and Vector Products 


We saw in the previous chapter how vector quantities may be added and subtracted. 
In this chapter we consider the products of vectors and define rules for them. First 
we will examine two cases frequently encountered in practice. 


1. In applied science we define the work done by a force as the magnitude of the 
force multiplied by the distance it moves along its line of action, or by the com- 
ponent of the magnitude of the force in a given direction multiplied by the dis- 
tance moved in that direction. Work is a scalar quantity and the product obtained 
when force is multiplied by displacement is called the scalar product. 

2. The torque on a body produced by a force F (Fig. 2.1) is defined as the product 
of the force and the length of the lever arm OA, the line of action of the force 
being perpendicular to the lever arm. Such a product is called a vector product 
or cross product and the result is a vector in the direction of the axis of rotation, 
i.e. perpendicular to both the force and the lever arm. 


Fig. 2.1 


2.1 Scalar Product 


Consider a carriage running on rails. It moves in the s-direction (Fig. 2.2) under 
the application of a force F which acts at an angle o to the direction of travel. We 
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require the work done by the force when the carriage moves through a distance s in 
the s-direction (Fig. 2.3). 


Fig. 2.2 Fig. 2.3 


In order to study the action of the force F on the carriage we resolve it into 
two components: one along the rails (in the s-direction), and one perpendicular to 
the rails, i.e. Fs and Fp respectively. F s, F 1 and s are vector quantities; the work 
is, by definition, the product of the force along the direction of motion and the 
distance moved. In this case, it is the product of Р, and s. It follows also from the 
definition that the work done by F is zero since there is no displacement in that 
direction. Furthermore, if the rails are horizontal then the motion of the carriage and 
the work done is not influenced by gravity, since it acts in a direction perpendicular 
to the rails. 

If W is the work done then W = F -cosa-s or F - s -cos œ in magnitude. 

Since work is a scalar quantity the product of the two vectors is called a scalar 
product or dot product, because one way of writing it is with a dot between the two 
vectors: 


W=Fy:s 
where 
|F | = |F |cosa 


It is also referred to as the inner product of two vectors. Generally, if a and b are 
two vectors their inner product is written a · b. 


Definition The inner or scalar product of two vectors is equal to the prod- 
uct of their magnitude and the cosine of the angle between their 


directions: 
a-b=abcosa (2.1) 


Geometrical interpretation. The scalar product of two vectors a and b is equal to 
the product of the magnitude of vector a with the projection of b on a (Fig. 2.4a): 


a-b=abcosa 
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Or it is the product of the magnitude of b with the projection of a on b (Fig. 2.4b): 


a-b=bacosa 


1-4 а —| 
(a) (b) 
Fig. 2.4 


In the case of the carriage, we can also evaluate the work done by the product 
of the magnitude of the force and the component of the displacement along the 
direction of the force (Fig. 2.5). 


Fig. 2.5 


Example A force of 5 N is applied to a body. The body is moved through a distance 
of 10 m in a direction which subtends an angle of 60? with the line of action of the 
force. 

The mechanical work done is 


U=F-s=Fscosa 
—5x 10 х cos 60° = 25Nm 


The unit of work should be noted: it is F 
newtons x metres — Nm or joules (J). This 

example could be considered to repre- 

sent the force of gravity acting on a body 

which slides down a chute through a dis- 

tance s; the force F = mg where m is the Fig, 2.6 
mass of the body and g the acceleration 

due to gravity (Fig. 2.6). 
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2.1.1 Application: Equation of a Line and a Plane 


The scalar product can be used to obtain the equation of a line in an x—y plane 
if the normal from the origin to the line is given (Fig. 2.7). In this case the scalar 
product of n with any position vector r to a point on the line is constant and equal 
to n?. Thus 


n? =n-r —(nx,ny):(x,y) 

n? = Xnx + yny 
nx n? 

у= —x4+— (2.2a) 
ny ny 


If we extend the procedure to three dimensions we obtain the equation of a plane in 
an x— y —z coordinate system: 


n? = xnx-Eyny t znz (2.2b) 


Fig. 2.7 


2.1.2 Special Cases 


Scalar Product of Perpendicular Vectors 
If two vectors а and b are perpendicular to each other so that о = 5 and hence 
cosa = 0, it follows that the scalar product is zero, i.e. а - b = 0. 

The converse of this statement is important. If itis known that the scalar product 
of two vectors a and b vanishes, then it follows that the two vectors are perpendic- 
ular to each other, provided that a 5 0 and 5 0. 
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Scalar Product of Parallel Vectors 


If two vectors a and b are parallel to each other so that o = 0 and hence cos о = 1, 
it follows that their scalar product a · b = ab. 


2.1.3 Commutative and Distributive Laws 


The scalar product obeys the commutative and distributive laws. These are given 
without proof. 


Commutative law a-b=b-a (2.3) 
Distributive law a-(b+c)=a-b+a-c (2.4) 


As an example of the scalar product let us derive the cosine rule. Figure 2.8 
shows three vectors; о is the angle between the vectors a and b. 


We have b+c=a 
с-а-ф 


We now form the scalar product of the vectors with themselves, giving 


c-c = c? = (а – b? 
c? =a-a+b-b—2a-b 
c? = а? +b? —2ab cosa (2.5) 


If o = 5, we have Pythagoras’ theorem for a right-angled triangle. 


2.1.4 Scalar Product in Terms of the Components of the Vectors 


If the components of two vectors are known, their scalar product can be evaluated. 
It is useful to consider the scalar product of the unit vectors i along the x-axis and 
j along the y-axis, as shown in Fig. 2.9. 
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From the definition of the scalar product we deduce the following: 


ii = 1 
jj = 1 
inj = 0 
ji= 0 
Fig. 2.9 


Figure 2.10 shows two vectors a and b that issue from the origin of a Cartesian 
coordinate system. If ay,bx,ay and by are the components of these vectors along 
the x-axis and y-axis, respectively, then 


= ауі+ау) 
b = bitbjj 
Fig. 2.10 


The scalar product is 
a: b = (axi +ay j): (bxi +byj) 
= axbyi d +ахђуі -j +aybxj -i +aybyj -j 
a:b = ахь; +ayby 


Thus the scalar product is obtained by adding the products of the components of the 
vectors along each axis (Fig. 2.11). 
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In the case of three-dimensional vectors it is easily demonstrated that the following 
rule holds true: 


a-b=axby+ayby+azbz (scalar product) (2.6) 


It is also an easy matter to calculate the magnitude of a vector in terms of its 
components. Thus 


a —a-a 


4ахах--4у4у- 44: 


Е ax? a5? az? 


la| = \/ax? +ау2 - az? 


& 
! 


(In Sect. 1.7, (1.55) 
Example Given that a = (2,3,1),b = (—1,0,4), calculate the scalar product. 


a:b = axbx +dyby +azbz 
-2х(-1)-3х0-1х4-2 


The magnitude of each vector is 


а = ү22 +32 +1= V14z 3.74 
b= у14+42 = V17 4.2 
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2.2 Vector Product 


2.2.1 Torque 


At the beginning of this chapter we defined the torque C, resulting from a force F 
applied to a body at a point P (Fig. 2.12), to be the product of that force and the 
position vector r from the axis of rotation O to the point P, the directions of the 
force and the position vector being perpendicular. 
The magnitude of the torque is therefore С = |r||F | or, more simply, С = rF. 
This is known as the lever law. 


Fig. 2.12 


A special case is illustrated in Fig. 2.13 where the line of action of the force 
F is in line with the axis (the angle between force and position vector r is zero). 
In this situation, the force cannot produce a turning effect on the body and conse- 
quently C = 0. 


Fig. 2.13 
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The general case is when the force F and the radius r are inclined to each other 
at an angle o, as shown in Fig. 2.14. To calculate the torque C applied to the body 
we resolve the force into two components: one perpendicular to r, Ё |, and one in 
the direction of r, F Il 

The first componentis the only one that will produce a turning effect on the body. 
Now F у = F sina in magnitude; hence C = rF sino. 


Fig. 2.14 


Definition Magnitude of torque C 
C =rF sing 


2.2.2 Torque as a Vector 


Physically, torque is a vector quantity since its direction is taken onto account. The 
following convention is generally accepted. 

The torque vector C is perpendicular to the plane containing the force F and the 
radius vector r. The direction of C is that of a screw turned in a way that brings r 
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by the shortest route into the direction of F. This is called the right-hand rule. To 
illustrate this statement let us consider the block of wood shown in Fig. 2.15 where 
the axis of rotation is at A and a force F is applied at P at a distance r. The two 
vectors r and F define a plane in space. F is then moved parallel to itself to act at 
A; as the screw is turned it rotates the radius vector r towards F through an angle 
a. Hence the direction of the torque C coincides with the penetration of the screw. 


2.2.3 Definition of the Vector Product 


The vector product of two vectors a and b (Fig. 2.16) is defined as a vector c of 
magnitude ab sino, where о is the angle between the two vectors. It acts in a di- 
rection perpendicular to the plane of the vectors a and b in accordance with the 
right-hand rule. 


SY 


Fig. 2.16 


This product, sometimes referred to as the outer product or cross product, is 
written 
c=axb ог c—a^b (2.7) 


It is pronounced ‘a cross b’ or ‘a wedge b’. Its magnitude is c = ab sina. Note 
that a x b = —b xa. 


This definition is quite independent of any physical interpretation. It has geomet- 
rical significance in that the vector c represents the area of a parallelogram having 
sides a and b, as shown in Fig. 2.17. c is perpendicular to the plane containing a 
and Р, direction given by the right-hand rule. 
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The distributive laws for vector products are given here without proof. 


ax(b+c)=axb+axc (2.8) 
and (a+b)xc=axc+bxe (2.9) 


Further, we note with respect to a scalar A that 


àa x b = a x àb = Аа x b) (2.10) 


Example Given two vectors a and b of magnitudes a = 4 and b = 3 and with an 


angle a = % = 30° between them, determine the magnitude of c = a x b. 


c =absin30° =4х3х 0.5= 6 


2.2.4 Special Cases 


Vector Product of Parallel Vectors 


The angle between two parallel vectors is zero. Hence the vector product is 0 and 
the parallelogram degenerates into a line. In particular 


axa=0 
It is important to note that the converse of this statement is also true. Thus, if the vec- 


tor product of two vectors is zero, we can conclude that they are parallel, provided 
that a Z 0 and b £0. 


Vector Product of Perpendicular Vectors 


The angle between perpendicular vectors is 90°, i.e. sina = 1. Hence 


la x b| = ab 


2.2.5 Anti-Commutative Law for Vector Products 


If a and b are two vectors then 


axb=-bxa (2.11) 
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Proof Figure 2.18 shows the formation of the vector product. The vector product 
is c =a x b and c points upwards. In Fig. 2.19, с is now obtained by turning b 
towards a, then, by our definition, the vector b x a points downwards. It follows 
therefore that a x b — —b x a. The magnitude is the same, i.e. ab sina. 


Fig. 2.18 Fig. 2.19 


2.2.6 Components of the Vector Product 


Let us first consider the vector products of the unit vectors i, j and k (Fig. 2.20). 
According to our definition the following relationships hold: 


1Х1-0 
1Х) =k 
1 k = ~j 
jxj=0 
jXk=i 
j i = —k 
kXk=0 
kXi =j 
k Xj =-i 
Fig. 2.20 


Let us now try to express the vector product in terms of components. The vectors 
а and Б expressed in terms of their components аге 


а = ахі c ayj +azk 
b — byi by j t bzk 
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The vector product is 

ахь = (ахі +ayj +azk) x (bxi + by j + bzk) 
Expanding in accordance with the distributive law gives 


a x b = (axbyi x i) + (axbyi x J) + (axbzi ХК) 
+ (aybx j x i) (aybyj x j) + (aybz j x k) 
+ (azbxk x i) + (azbyk x J) + (azbzk x k) 


Using the relationships for the vector products of unit vectors we obtain 
a x b = (aybz — azby)i + (azbx — axbz)j + (axby — aybx)k (2.12a) 


The vector product may conveniently be written in determinant form. A detailed 
treatment of determinants can be found in Chap. 15. 


ij k 
ax b = |ах ауа; (2.12) 
bx by bz 


Example The velocity of a point P on a rotating body is given by the vector product 
of the angular velocity and the position vector of the point from the axis of rotation. 
In Fig. 2.21, if the z-axis is the axis of rotation, the angular velocity @ is a vector 
along this axis. If the position vector of a point P is r = (0,ry,rz) and the angular 
velocity œ = (0,0, ою z), as shown in the figure, then the velocity v of P is 


ijk 
v=o@xr=|0 0 !:| = –гу!:і 
Ory rz 
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Exercises 


2.1 Scalar Product 
1. Calculate the scalar products of the vectors a and b given below: 
(a) a=3 b=2 а-л/3 (5) a=2 b=5 a=0 
(c)a=1 b=4 o —z/A (4) а= 2.5 b=3 a = 120° 


2. Considering the scalar products, what can you say about the angle between 
the vectors a and b? 


(a) a-b=0 (b) a-b=ab 
() a.b — 5 (d) a-b «0 
3. Calculate the scalar product of the following vectors: 
(a) i —1, 4) (b) ae A —1/3) 
- (21,2, 5) = (1/6, —2, 3) 
(c) ae 1/4, 2, —1) (d) Ет —6, 1 
b = (1, 1/2, 5/3) b —(—1, —1, –1) 
2. Which of the following vectors a and b are perpendicular? 
(a) a — (0, —1, 1) (b) a = (2, —3, 1) 
b — (1, 0, 0) b=(- 1, 4, 2) 
(c) a = (—1, 2, —5) (d) a= (4, —3, 1) 
b = (—8, 1, 2) 8-1-1,-2,-2) 
(e) a=(2, 1,1) (f) a= (4, 2, 2) 
b = (-1, 3, —2) b — (1, —4, 2) 
5. Calculate the angle between the two vectors a and b: 
(а) а= (1, —1, 1) (b) а= (2, 2, –1) 
b= (-1, 1, —1) b = (0, 3, 0) 


6. A force F = (ON, 5N) is applied to a body and moves it through a distance 
s. Calculate the work done by the force. 
(a) sy = (3m, 3m) (b) s2 = (2m, 1 m) (c) s3 = (2m, 0m) 


2.2 Vector Product 
7. Indicate in figures 2.22 and 2.23 the direction of the vector c ifc = a x b 


(a) when a and b lie in the x —y plane 
(b) when a and b lie in the y—z plane 


8. Calculate the magnitude of the vector product of the following vectors: 
(a) a=2 b=3 a=60° (b) a=1/2 b=4 а = 0° 
(c) а= 8 b —3/4 а = 90° 
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x 


Fig. 2.22 Fig. 2.23 


9. In figure 2.24 a = 2i, b = 4j, c = —3k (i, j and К are the unit vectors 
along the x-, y- and z-axes, respectively). Calculate 
(a) axb (D axc (c) exa 
(d) bxc (е) b xb (f) exb 


Fig. 2.24 


10. Calculate c — a x b when 
(a) a — (2, 3, 1) (b) a = (—2, 1, 0) 
b = (–1, 2, 4) b= (1, 4, 3) 


Chapter 3 
Functions 


3.1 The Mathematical Concept of Functions 
and its Meaning in Physics and Engineering 


3.1.1 Introduction 


The velocity of a body falling freely to Earth increases with time, i.e. the velocity of 
fall depends on the time. The pressure of a gas maintained at a constant temperature 
depends on its volume. The periodic time of a simple pendulum depends on its 
length. Such dependencies between observed quantities are frequently encountered 
in physics and engineering and they lead to the formulation of natural laws. 

Two quantities are measured with the help of suitable instruments such as clocks, 
rulers, balances, ammeters, voltmeters etc.; one quantity is varied and the change in 
the second quantity observed. The former is called the independent quantity, or 
argument, and the latter the dependent quantity, all other conditions being carefully 
kept constant. The procedure to determine experimentally the relationships between 
physical quantities is called an empirical method. Such a method can be extended 
to the determination of the relationships between more than two quantities; thus the 
pressure of a gas depends on its volume and its temperature when both volume and 
temperature vary. 

Relationships obtained experimentally may be tabulated or a graph drawn show- 
ing the variation at a glance. Such representations are useful but in practice we prefer 
to express the relationships mathematically. 


A mathematical formulation has many advantages: 


e [tis shorter and often clearer than a description in words. 

* [tis unambiguous. Relationships described in such a way are easy to communi- 
cate and misunderstanding is out of the question. 

* [t enables us to predict the behaviour of physical quantities in regions not yet 
verified experimentally; this is known as extrapolation. 


The mathematical description of the relationship between physical quantities may 
give rise to a mathematical model. 


K. Weltner, W. J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
ISBN 978-3-642-00172-7 © Springer 2009 


40 3 Functions 


3.1.2 The Concept of a Function 


We now investigate the exact mathematical description of the dependence of two 
quantities. 


Example Consider a spring fixed at one end and stretched at the other end, as 
shown in Fig. 3.1. This results in a force which opposes the stretching or displace- 
ment. Two quantities can be measured: the displacement x in metres (m); the force 
F in newtons (N). 

Measurements are carried out for several values of x. Thus we obtain a series of 
paired values for x and F associated with each other. 


1. The paired values are tabulated as shown in Fig. 3.1. The direction of the force 
is opposite to the direction of the displacement. 
Such a table is called a table of values for all displacements x of the spring for 
which it is not permanently deformed or destroyed. The range of x is called the 
range or domain of definition. The corresponding range of the functional values 
is called the range of values (sometimes referred to as the co-domain). 


Displacement Force 


(m) (N) 

0 0 

0.1 -12 х0 

0.2 —24 Position of rest 

0.3 —3.6 1 

0.4 —4.8 

0.5 —6.0 г F 

0.6 —7.2 —- let 
rt 

Fig. 3.1 


2. We plot each paired value on a graph and draw a curve through the points. This 
enables us to obtain, approximately, intermediate values (Fig. 3.2). 


О оз 04 06 х) 


Fig. 3.2 


3. The relationship between x and F can be expressed by a formula which must 
be valid with the domain of definition. In this case the formula is 


Е--ах, where a=12N/m 
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By substituting values of x we obtain the corresponding values of F. We notice that 
there is only one value of F for each value of x. The formula is unambiguous. 

The letters x and y are frequently used in mathematics to represent paired values, 
80 we can write y = —ax. 

Let us recapitulate. A function may be expressed in different ways: 


by setting up a table of values; 
graphically; 
by means of a formula. 


These three ways of representing a function are of course related. For example, we 
can draw up a table of values from the formula or from a graph. 

If y depends on x then y is said to be a function of x; the relationship is ex- 
pressed as 

у= f(x) 

It reads “у equals f of x". 

In order to define the function completely we must state the set of values of x for 
which it is valid, i.e. the domain of definition. 

The quantity x is called the argument or independent variable and the quantity y 
the dependent variable. 

Once the nature of the function is known, we can obtain the value of y for each 
value of the argument x within the domain of definition. 


Example y — 3x? 
The function in this case is 3x?. For a given value of the argument x, for example 
X — 2, we can calculate y: 
у=3х22= 12 
A function can be quite intricate, for example: 
ax? 


(1— x2)? +bx? 


Mo 


This is an expression found in the study of vibrations. 
For the sake of clarity let us give a formal definition: 


Definition Given two sets of real numbers, a domain (often referred to 
as the x-values) and a co-domain (often referred to as the 
y-values), a real function assigns to each x-value a unique 
y-value. 


In this book we will mainly be concerned with real functions, as opposed to 
more general functions like complex functions. Note that the concept of a function 
implies that the y-value is determined unambiguously. During the previous one or 
two decades the use of the term ‘function’ has changed. In the engineering literature, 
the term ‘two-valued’ or *many-valued function’ is still occasionally used. Strictly 
speaking, in modern terminology what is meant is not a function but a relationship. 
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Example Consider the equation y? = x; it is clear that x has only one value for 
two values of y, e.g. if yy = +2 and y2 = —2 then x = 4 in each case. The equation 
may be rewritten thus: 

y=tvx 


This means that for every (positive) value of x,y has two possible values, --/x 
and —4/x. Hence y is a two-valued ‘function’. Which root we assign to y will, in 
general, depend on the nature of the problem. For instance, the equation 


ax? 


= (1 = x2)? -bx? 


y= 


mentioned previously, is two-valued but the negative root has no physical meaning. 

The ambiguity is removed by, e.g. restricting the value of y to the positive root; 
thus у = +/x is unambiguous. This is a function; its range of values is y > 0. From 
now on, whenever we use the symbol ,/_ for the square root, the positive root is to 
be understood. 


3.2 Graphical Representation of Functions 


3.2.1 Coordinate System, Position Vector 


Many functions can easily be represented graphically. Graphs are usually based 
on a rectangular coordinate system known as a Cartesian system (after the French 
mathematician Descartes). The vertical axis is usually referred to as the y-axis, and 
the horizontal axis as the x-axis (Fig. 3.3). In certain applications they may bear 
different labels such as 1,0, etc. The axes intersect at the point O, called the origin 
of the coordinate system. 

Associated with each axis is a scale and the choice of this scale depends on the 
range of values of the variables. The coordinate system divides a plane into four 
regions, known as quadrants, numbered counterclockwise. A point Ру (Fig. 3.4) 


Fig. 3.4 
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is uniquely defined in the coordinate system by two numerical values. If we drop 
a perpendicular from P4, it meets the x-axis at Py. Py is called the projection of 
P, on to the x-axis and is related to a number x, on the x-axis, the x-coordinate or 
abscissa. 

In a similar way, Py is the projection of P, on the y-axis, and we find a number 
Ут, the y-coordinate or ordinate. Thus, if we know both coordinates for the point 
P1, then it is uniquely defined. 

This is often written in the following way: 


Py = (x1,y1) 


The coordinates represent an ordered pair of numbers: x first and y second. The 
point P, in Fig. 3.4 is defined by xı = 2 and yı = 3, or P, = (2,3). 

As an aside, note that the distance measured from the origin O of the coordinate 
system and the point P, as a directed distance is called the position vector and its 
projections on the axes are referred to as its components. These components are 
directed line segments. Vectors will be introduced in detail in Chap. 1. 


3.2.2 The Linear Function: The Straight Line 


A straight line is defined by the equation 


y=ax+b 


We can obtain a picture of the line very quickly by giving x two particular values 
(Fig. 3.5): 
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The constant a is the slope of the line. If two points of a straight line are known 
its slope can be calculated: 
PT y2— y1 6.1) 
X2—X1 


Proof Consider two arbitrary points on the line, Py = (x1, y1) and P5 = (x2, yz). 
Substitution in the equation for the straight line gives 


yı = ахі +b 


у =ax2 +b 


If these are now substituted in the right-hand side of the equation to find the slope, 


we have 
(ахо +0) – (axı +b) aļ(x2—xı) 


X2 — ХІ X2—X1 


The constant b is the intercept of the line on the y-axis (Fig. 3.6), i.e. the point of 
intersection of the line with the y-axis has the value b. 


3.2.3 Graph Plotting 


Consider the function i 
= 1 
У х-1 = 


and suppose we wish to plot its graph. There are three basic steps to follow: 


1. Set up a table of values. The best way to do this is to split up the function by 
taking a convenient number of simple terms as illustrated in the table 3.1. 


Table 3.1 
1 1 
2 FT x+1 > 
—4 -3 -0.33 0.67 
-3 -2 -0.50 0.50 
-2 -1 -1.00 0 
-1 0 оо оо 
0 1 1.00 2.00 
1 2 0.50 1.50 
2 3 0.33 1.33 
3 4 0.25 1.25 
4 5 0.20 1.20 
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2. From the table of values, place each point whose coordinates are (x, y) onto the 
coordinate system (Fig. 3.7). 
3. Draw a smooth curve through the points as shown in Fig. 3.8. 


Fig. 3.7 Fig. 3.8 


We observe that for this function there is a difficulty at x = —1. As x tends 
to the value —1 the function grows beyond limit: it tends to infinity. In order 
to obtain a better picture of the behaviour of the function it is advisable to take 
smaller steps and hence calculate additional values in the neighbourhood of x = — І, 
e.g. —1.01, —1.001, —0.95, —0.99, etc. This means that we increase the density of 
the points to be taken close to x = — 1, whereas for other values of x where the graph 
changes less dramatically we can increase the distance of x between the points. 

Physicists and engineers often require a picture of the way a function behaves 
rather than to know its exact behaviour. The process of obtaining such a picture is 
called curve sketching. For this purpose it is important to be able to identify the 
salient features of a function and these we will now investigate. 


Poles 


These are the points where the function grows beyond limit, i.e. tends to infinity 
(+ or —ce). In the above discussion, such a point was found at x = — 1. Poles are 
also referred to as singularities. The corresponding x-values are excluded from the 
domain of definition. 

To determine where the poles occur we have to find the values of x for which the 
function y — f (x) approaches infinity. In the case of fractions, this occurs when the 
denominator tends to zero, provided that the numerator is not zero. In our example 
we need to consider the fraction Tu 

We see that the denominator vanishes when x — — 1; thus our function has a pole 
at the point хр = — 1. 

Poles can also be found by taking the reciprocal of the function so that for y to 
tend to infinity the reciprocal must tend to zero. 
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Asymptotes 


When a curve has a branch which extends to infinity, approaching a straight line, 
this line is called an asymptote. In our example such an asymptote is the line y = 1 
which is parallel to the x-axis. 


Zeros of a Function 


These occur where the curve crosses the x-axis. To find their positions we simply 
have to equate the function to zero, i.e. y = 0, and solve for x. 


In our example we have 


1 
——+1=0 
1+x 


Solving for x gives x = —2. 


Maxima and Minima and Points of Inflexion 


These are other characteristic points of a function which are discussed in Chap. 5, 
Sect. 5.7. 
As a further example, consider the function 


у-х2-2х-3 


It is a parabola. It does not have poles ог asymptotes. The zeros are found by equat- 
ing it to zero and solving for x, which is shown in the following section: 


х2-2х-3-0 


The zeros are x; = 3 and x? = —1. 
The graph of the function is shown in Fig. 3.9. 


Fig. 3.9 
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3.3 Quadratic Equations 


Any equation in which the square, but no higher power of the unknown, occurs is 
called a quadratic equation. The simplest type, the pure quadratic, is x? — 81, for 
example. To solve for x we take the square root of both sides. Then x — 4-9 or 
x = —9 since (4-9)? = 81 and (—9)? = 81; hence х = +9. It is essential in practice 
to state both values or solutions, although in some situations only one value will 
have a physical significance. 

The general expression for a quadratic takes the form 


ах? bx4 c0 


Because of the squared term this equation has two solutions or roots. It can be solved 
by 'completing the square'. We proceed as follows. 

The terms containing the unknown are grouped on one side of the equation and 
the constants on the other side. The left-hand side is made into a perfect square by 
a suitable addition, the same amount being added to the right-hand side. Then the 
square root of both sides is taken. 

Hence the roots are found as follows: 


ax*+bx+c=0 


We subtract c on both sides, giving 


ax? + bx = —c 
We divide throughout by a: 
2 b Ç 
x^4-lx-2-- 
a a 


To make the left-hand side into a perfect square we must add 5? /4a? to both sides: 


p? Е p? c 02 4ас 
442 4a? a 4a? 


2 b 
X +-х+ 
a 


Hence 
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Taking the square root of both sides gives 


b | b? — Дас 1 L———— 
PE Е LE 52-4 
2а 4а2 2а У А 


Hence х = (—5)/(2a) + 1/(2a) Vb? — 4ас 


ы 
+ 
II 


The roots of the quadratic equation ax? +bx+c=0 are 


—b + Vb? – 4ас —b — Vb? — 4ас 
x = T ad = 0 y (3.22) 


Quadratic equations occur frequently in physics and engineering; the formulae 
for the two roots should be remembered. Often the quadratic equation is written in 
the form 

x? + px+q=0 


The roots of the quadratic equation x? + px +q = 0 are 


Spey a 4g е аде ане 
Ss aae M (32b) 


X1 2 


As a check, it is easy to verify that with these values of x1, x2 
х2 +рх+9 = (x — x1)(x — x2) 


i.e. the quadratic is expressed as the product of two linear expressions. 

It should be noted that there are cases when no real solutions exist. The general 
quadratic equation ax? + bx +c = 0 has real solutions if the expression b? — 4ac, 
called the discriminant, is positive, i.e. the square root can be extracted. It is not 
hard to see that this corresponds to the function f (х) = ax? -- bx +c having zeros. 

Conversely, if the discriminant b? — 4ac is negative, then the function does not 
have a zero, i.e. its graph does not cut the x-axis. 

We therefore have a criterion which allows us to decide whether a parabola, given 
by an algebraic expression, lies entirely above the x-axis or below the x-axis; the 
condition is b? — 4ac < 0. 
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3.4 Parametric Changes of Functions and Their Graphs 


Very often functions contain constants which may be chosen deliberately; these are 
called parameters. By considering the corresponding graphs we will now study the 
changes in the shape of the graphs associated to common variations of parameters. 
For the following examples we will use the standard parabola, but the effects are 
not limited to parabolas, of course. For instance, we will apply the rules instantly to 
trigonometric functions. 


Multiplication of the function by a positive constant C 


Effect: The graph will appear elongated along the y-axis if C > 1. It is compressed 
if C « 1. 


x 


C=2 y=C:x? C=0.5 
elongated compressed 


Fig. 3.10 


Adding a constant C to the function 


Effect: The graph will be shifted along the y-axis by the amount C. 


y y: y 
2 3 2 
1 1 1 
X 1 X X 
у = х? С= 2 y=x +C С= 0.5 
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Multiplication of the argument by a positive constant C 


Effect: The graph will appear compressed along the x-axis if C > 1. It is elongated 
ifC « 1. 


у= х? 


Fig. 3.12 


Adding a constant С to the argument 


Effect: The graph will be shifted along the y-axis by the amount C: The direction is 
to the left if C is positive and to the right if C is negative. 


У, У У, 


1 A 2 1 = 1 * 
у= х? C22 y=(+C? С=0.5 


3.5 Inverse Functions 


Given a function y = f (x), its inverse is obtained by interchanging the roles of x 
and y and then solving for y. The inverse function is denoted by y = f~!(x). For 
example, if y = ax +b where a and b are constants, then the inverse function is 
у= x/a —b/a. 

Geometrically, the formation of the inverse can be understood in two ways which 
are equivalent: 


1. Interchanging the roles of x and y is equivalent to interchanging the roles of the 
coordinate axes. In this case the graph remains unchanged but now we have a 
y—x coordinate system. 
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2. If we keep the x— y coordinate system, as is generally done, the graph of the 
inverse function is obtained by reflecting the graph of f(x) in the line y = x. 
This is shown in Fig. 3.14. AA’ is the bisecting line. 


Fig. 3.14 


The figure shows the function y = 2x? and its inverse y = +,/x/2, showing, 
in this case, that the inverse 'function' has two values for every x, i.e. it is not 
a monotonic function. The problem is solved by restricting the domain of the origi- 
nal function y = 2x? to x > 0. Then the inverse function is у = \/x/2. 

Multi-valued inverse functions cannot occur when y = f(x) is a continuous 
and monotonic function. Such a function is defined as follows. A continuous func- 
tion y = f(x) is monotonic if, in the interval хү < x < x», it takes all the val- 
ues between f(x1) and f(x2) only once. Such a function has a unique inverse, 
y = f(x), which is itself a continuous monotonic function in the corresponding 
interval. Thus the inverse is also single-valued. (The concept of continuity is treated 
in detail in Chap. 5, Sect. 5.2. For the time being it will suffice to say that all func- 
tions commonly encountered, such as power functions, fractional rational functions 
and trigonometric functions, are continuous throughout their domain of definition.) 
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3.6 Trigonometric or Circular Functions 


3.6.1 Unit Circle 


A circle having a radius equal to unity is called a unit circle and is used as a reference 
(Fig. 3.15). 


Fig. 3.15 


In geometry, angles are measured in degrees. A right angle has 90°, whilst the 
angle around the four quadrants of a circle is 360° or the total angle at the center is 
360°, this being subtended by the circumference. 

In physics and engineering, angles are usually measured in radians (abbreviated 
to rad). In radians an angle of 360° has the value of the circumference of the unit 
circle, namely 27r. 

It follows that since 


3607--2л rad 
then pae L 257.35 
2л 
gas nbus 
and 1 =з = 0.01745 rad 


To convert an angle from degrees to radians, we have 


о 


Prad = ? d 


—— га 
57.3 
and to convert an angle from radians to degrees we have 


ф = фаа х 57.3 


It is customary for angles to be considered positive when measured anticlockwise 
from the x-axis (Fig. 3.15) and negative when measured clockwise. 
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3.6.2 Sine Function 


The sine function is frequently encountered in physical problems, e.g. in the study 
of vibrations. The sine of an angle is defined by means of a right-angled triangle, as 
shown in Fig. 3.16a; the sine of an angle is the quotient of the side opposite and the 
hypotenuse. Its magnitude is independent of the size of the triangle. 


sing = (3.3) 


Hypotenuse = 1 


a(opposite) = sing 


b(adjacent) 


(a) 
Fig. 3.16 


Considering now the unit circle shown in Fig. 3.16b, let P be a point on the 
circumference; the position vector of the point makes an angle @ with the x-axis. It 
follows that the y-coordinate of the point P is equal to the sine of the angle for, by 
definition, 


sing = 2 
r 


but since r = | we have y = sing. 
This is true for all points on the circumference and therefore for all angles. 


Definition The sine of an angle ¢ is the y-coordinate of the point P on the 
unit circle corresponding to ф. 


A graphical representation of the sine function is obtained by plotting $ as the 
independent variable (the argument) and sin@ (the dependent variable) as ordinate, 
as shown in Fig. 3.17 for $ between 0 and 27 radians. This corresponds to one 
complete revolution of the point P on the unit circle. 
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If P is allowed to move several times around the unit circle, then ф grows beyond 
2л and takes on large values, as shown in Fig. 3.18. For each revolution of P the 
values of the sine function are repeated periodically. 


Definition А function y = f (x) is called periodic if for all x within the 
range of definition we have 


f(x p) = f(x) 


where p is the smallest value for which this equation is valid. p 
is called the period. 


The sine function has a period equal to 27. 
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If P is allowed to revolve clockwise around the unit circle then $ is negative, 
by our definition, and this is equivalent to the sine function being continued to the 
left, as shown in Fig. 3.19. For negative values of $, for example ф = —1, the sine 
function has the same value as for ф = 1 except for the change in sign, i.e. 


sin(—$) = —sing 


Fig. 3.19 


If a function f (x) is such that f(—x) = — f (x), then it is called an odd function. 
The sine function is odd. 

If a function f(x) is such that f(—x) = f(x) it is called an even function. An 
example will be discussed in Sect. 3.6.3 cosine function. 

If the sine function is plotted in a Cartesian x— y coordinate system, then x rep- 
resents the angle in the unit circle and y the sine of the angle (Fig. 3.20). 


Fig. 3.20 


Values of the sine functions can be obtained using a calculator or from tables, but 
the former is more convenient. They can also be calculated using power series, as 
will be shown in Chap. 8. 

In the following we apply the rules obtained in this Section. 


Amplitude 
The function y = sin х has an amplitude of 1 unit; the range of values of the function 
18-15 y € 1. If we multiply the sine function by a factor A we obtain a function 


having the same period but of amplitude A. 


Definition Тһе amplitude is the factor A of the function 


y — Asinx 
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Figure 3.21 shows sine functions with A = 2, 1 and 0.5 respectively. 


yı =2sinx | (dashed curve) 


y2 —sinx (full curve) 


ys =0.7sinx  (dot-dash curve) 


Fig. 3.21 


Period 


Multiplying the argument of a sine function by a constant factor changes the period 
of the function. For example, 
y =sin2x 


where the argument is 2x. Plotting this function as shown in Fig. 3.22 reveals that 
the period is zr, i.e. the function oscillates at twice the frequency as does the function 
y —Ssinx. 


In general, the period p of the sine function 
y =sinbx 


is given by 
poe (3.4) 
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Figure 3.23 shows the graphs of the function y = sinbx for a large and a small 
value of b. 


» 


14 


b(small) 


b(large) 


Fig. 3.23 


In physics and engineering, we frequently encounter the following notation: 
y —sinot 


The constant 5 has been replaced by the symbol w which stands for circular fre- 
quency (in radians per second) and f which stands for time (in seconds). The circular 
frequency o is the number of oscillations in a time of 27s. 
The frequency f in cycles per second or hertz is the number of oscillations in 
a time of 1s. Hence the circular frequency w and the frequency f are related as 
follows: 
о = 2лў 


Phase 
Consider the function 
y =sin(x +c) 


The effect of adding a constant to the argument x is shown in Fig. 3.24 for a partic- 
ular case where с = л /2 radians. 


Fig. 3.24 


The graph shows that the sine curve is shifted by an amount л /2 to the left. This 
constant is called the phase. 
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Definition The phase is the constant term added to the argument of 
a trigonometric function. 


For a positive phase the curve is shifted to the left and for a negative phase it is 
shifted to the right. It is usual in physics and engineering to use a Greek letter such 
as фо to denote a phase. 


The function y = Asin(ot + фо) is a sine function of amplitude A, of the circular 
frequency o and phase angle фо. 

The function y = Asin(ot + фо) is said to lead the function y = B ѕіп оѓ by фо. 
The function y = Asin(wt — фо) is said to lag the function y = B sinwt by фо. 


3.6.3 Cosine Function 


The cosine of an angle is defined as the ratio of the adjacent side to the hypotenuse 
in a right-angled triangle (Fig. 3.25). 


coso = ? (3.5) 


Consider a point P on the unit circle shown in Fig. 3.26. The cosine of the angle ф 
is equal to the length of the abscissa, the x component, which is the projection of P 
on the horizontal axis. 


M 
H 
4! | 
с (hypotenuse), Al : 
а (opposite) 212 X 


b (adjacent) 


Fig. 3.25 Fig. 3.26 


If, as shown in Fig. 3.27, x is the angle turned through by the radius of the unit 
circle and y the projection of the point P on the horizontal axis, then 


y —cosx 


Definition The cosine of an angle $ is the x-coordinate of the point P on 
the unit circle corresponding to $. 
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Fig. 3.27 


Figure 3.27 illustrates the graph of this function for positive and negative values 
of x. We observe that the cosine function is an even function. The cosine function 
can be obtained from the sine function by shifting the latter to the left by л /2 radi- 
ans; hence, by inspection of the graphs, 


cos X = sin (x + 2) (3.6) 


It follows that the sine function can be obtained from the cosine function by shifting 
the latter by an amount л /2 radians to the right; hence 


1 л 
sin x = cos Жон 


Whether one uses the sine or cosine function depends on the particular situation. 


Amplitude, Period and Phase 


The general expression for the cosine function is 


y = Acos(bx +c) 


where A = amplitude 
2л 
=" = period 
b perio 


c — phase; the curve is shifted to the left if c is positive 
and to the right if c is negative . 


3.6.4 Relationships Between the Sine and Cosine Functions 


1. In Fig. 3.28, the position vector to the point P makes an angle $ with the x axis, 
while the point P, is obtained by subtracting a right angle from ф. We have 
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Fig. 3.28 


From the figure it is clear that 


sing = cos$, = cos С - 2) 


Similarly, as already mentioned in the previous section, 


cos¢ = sin С + 2) 
2. Applying Pythagoras’ theorem to the right-angled triangle in Fig. 3.29 gives 


sin? ф + соѕ2ф = 1 (3.7) 


yj 

1 
/ yh 
12 x 


Fig. 3.29 


From this identity there follow two relationships which are frequently used: 
sing = V 1-— cos? $ 
cos = 4/ 1 — sin? ф 


They are valid for values of ф between 0 and л /2. For larger angles the sign of the 
root has to be chosen appropriately. 
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3.6.5 Tangent and Cotangent 


The tangent of the angle ф in Fig. 3.30 is defined as the ratio of the opposite side to 
the adjacent side. 


2 a(opposite) 
tang = 5 (3.8a) 
PA g 
b(adjacent) 
Fig. 3.30 
From the definition of the sine and cosine functions it follows that 
uj E (3.8b) 
coso 


As for the sine and cosine, the graph of the tangent function can be derived from the 
unit circle. In Fig. 3.31 we erect a tangent at A to the unit circle until it meets the 
radial line OP extended to Р”, The point Р” has the value tan $. 


Fig. 3.31 


As ф approaches x /2 the value of tang grows indefinitely. The function у = 
tan ф is shown in Fig. 3.31. The period of tan 9 is л. 
The cotangent is defined as the reciprocal of the tangent so that 


rr m: (3.9) 


— tang  sinó 
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3.6.6 Addition Formulae 
A trigonometric function of a sum or difference of two angles can be expressed 


in terms of the trigonometric values of the summands. These identities are called 
addition formulae. 


sin($4 + $5) = sing; cos¢2 + cos, sin $2 
соѕ(фі + $2) = соѕфі cos h2 — sing; sin $2 (3.10) 


If фэ is negative, then, noting that 


sin(—$5) = — sin ġ2 
cos(—2) = cos» 


we immediately obtain the addition formulae for the difference of two angles: 


sin($1 — $2) = sing; cos p2 — cosQ sin фо 
cos($1 — $2) = соѕфі cosa + sing) sin фо (3.11) 


The proof for 81401 + фә) is as follows. From Fig. 3.32 we have 


$ —$61t02 


We drop a perpendicular from P2 on to the position vector Pı obtaining а right- 
angled triangle having sides 


а = со8ф2 
b = sing2 
The sine of the angle ф = фу + фо is given by the line segment P5 О made up of two 


segments c and d; thus 


sin($1 + ф2) =c+d 
= asing; + bcos, 
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Substituting for a and b gives 


sin($1 + $2) = sing) cosa + cos фі sin $2 


The proof for cos(¢1 + $2) can be developed geometrically in a similar fashion; 
alternatively it can now be given algebraically, since соѕф = sin(¢ + 2/2). 


Sum of a Sine and a Cosine Function with Equal Periods 


The sine and cosine functions have the same period but the amplitudes may be 
different. Their sum should result in a single trigonometric function of the same 
period but with different amplitude and with a phase shift. 


Superposition formula . . 
Asing + B cosó = C ѕіп(ф + фо) 


where С = ү 42 + B? 


В 
апа tan фо = 4 (3.12) 


This relationship is important in the study of waves and vibrations. 
Figure 3.33 illustrates the superposition of the two functions 


yi1—12sinó and уә = 1.6соѕф 


with the resultant уз = 2sin(¢ + 53°). 


Fig. 3.33 
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The proof of the superposition formula is as follows. From Fig. 3.34 we have 
a=Asing 
b= Всоѕф 

and a+b=Csin(¢+¢9) 


It follows that 


a+b = Asing + В coso 
= C sin(¢+ фо) 


Furthermore, we see that 


C—wVA24 B? Fig. 3.34 


B 
tango = я 


Further important relationships will be found in the appendix at the end of this 
chapter. All formulae follow from the addition formulae given and the known rela- 
tionships between trigonometric functions. 


3.7 Inverse Trigonometric Functions 


Trigonometric functions are periodic and therefore cannot be monotonic; conse- 
quently their inverses cannot be formed unless the domain of definition is restricted. 
The restricted domains of definition are chosen as follows: 


T<x<Ī fo y-si 
p or y=sinx 
О<х<л for y=cosx 
л л 
E^ for y=tanx 
О<х<л for y=cotx 


Inverse sine function: 
y = sin! x defined for |x| < 1 and 
y&En2 


Fig. 3.35 


3.7 Inverse Trigonometric Functions 65 


Inverse cosine function: 
y = соѕ! x defined for |х| < 1 and 
О<у<л 


Inverse tangent function: 
y = tan! x defined for all real 
values of x and |y| € л/2 


Inverse cotangent function: 
y = cot ! x defined for all real 
values of x and 0 € y € x 


m dem restricted function 
inverse function 
ee original function 


Fig. 3.35 (continued) 
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The symbol sin~! x is used to denote the smallest angle, whether positive or nega- 
tive, that has x for its sine. The symbol does not mean 1/sinx; it is understood as 
‘the angle whose sine is x’. It can be written as arcsin x. The other inverse trigono- 
metric functions may be written in a similar way, using either symbol, e.g. tan! x 
or arctan x. 

We know from our discussion of inverse functions in Sect. 3.5 that the inverse 
function is the mirror image of the original function in the bisector of the first quad- 
rant in a Cartesian coordinate system. The graphs of the inverse trigonometric func- 
tions are shown in Fig. 3.35. 


3.8 Function of a Function (Composition) 


We frequently encounter functions where the independent variable is itself a func- 
tion of another independent variable. For example, the kinetic energy T of a body 
is a function of its velocity v: 


T= f(v) 
But in many cases the velocity is itself a function of the time f, so that 
v=g(t). 


It is therefore evident that the kinetic energy can also be considered a function of 
time t; hence we have 
T= f(g(t)). 


Definition А function of a function is expressed in the following form: 


у= flg(x)] 


f is called the outer function, g is called the inner function. 
The new function is also referred to as the composition of the 
functions f and g. 


Example 


We require y = f(x) 
The solution is у = (x + 1)? 


This is a rather simple example. To demonstrate that the use of the concept of com- 
position may simplify calculations, consider the following function: 


y =sin(bx +c) 
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To calculate the value of y we first evaluate the inner function g and then apply the 
outer function. Hence we compute g = bx + с independently and then obtain the 
sine of g. 

As a very special, but nevertheless important example, consider the case where 
f and g are inverse functions, i.e. g(x) = f~!(x). Then the following identities 
hold true: 


Loosely speaking, the functions f(x) and /7 (х) have opposite effects. For 
example, let 


Then rae oy 
and XC 


Appendix 


Relationships Between Trigonometric Functions 


sin(—$) = —sing sin? ф +cos* ¢ = 1 
cos(—9) = cos ó tang = = 
: л\ coso 
sin ($+ 2) = cos со!ф und 
cos ( + 5) = —sin$ó 
Addition Theorems 
sin($1 + $5) = sing; соѕфо + cos, sin фо 
sin($1 — $5) = sing; cos 2 — cos¢ sin фә 
cos($1 + $2) = соѕфі cos фә — sing) sin фо 
cos($1 — $2) = соѕфі cos do + sing) sin фә 


510 2ф = 2510 фсоѕф 


ane = Tu —cos¢) 


2 2 
sing; +510 фә = 2 (sin $i ын cos 91 52) 
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Table of Particular Values of Trigonometric Functions 


Radians 0 х шд ыг m 
6 4 3 2 
Degrees 0° 30° 45° 60° 90° 
1 1 
ѕіпф 0 5 =у2 0.707 5У3% 0.866 1 
ф 1 1 3 x 0.866 1 2 ~ 0.707 : 0 
cos 2 520. 2 LU. 2 
1 
tang 0 3У3 1 V3 = 1.732 oo 


Chapter 4 


Exponential, Logarithmic 
and Hyperbolic Functions 


4.1 Powers, Exponential Function 


4.1.1 Powers 


Consider the multiplication of a number by itself, for example a x a x a. A simple 
way of expressing this is to write a>. If we multiplied a by itself n times we would 
write a”. We would say a to the power n;n is known as an index or exponent. 


Definition The power a" is the product of n equal factors a. 
à is called the base and n the index or exponent. 


This defines the powers for positive integral exponents only. What about negative 
exponents? 

If a" is reduced by the factor a, this is equivalent to dividing it by a, i.e. a" /a. 
The number of factors is now n — 1; hence we would write 


K. Weltner, W. J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
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By this process we have, in fact, given a meaning to a negative index. Hence 


3 
a 
n>0O a? = — — aa 
a 
2 
а 
аї = — =a 
а 
1 
а 
п= 0 a =_= 1 
а 
1 
п<0 ат! = - 
а 
1 
22 
саныг 
Definition i 
ge (4.1) 


a9 = 1 is valid for any base a except for a = 0 because 0° is not defined. 


4.1.2 Laws of Indices or Exponents 


Product а" ха" = qim (4.2) 
8 a" п-т 

Quotient gn a (4.3) 

Power (a? y? =а"" (4.4) 

Root ат = Va 


gm = ah (5) = Va" (4.5) 
т 


This is only defined for a > 0. 


These rules are valid for integral indices and also for arbitrary indices; the latter will 
be examined in Chap. 8. 
There are three values of the base a which are in common use: 


1. Base 10: Order of magnitude of quantities can be easily expressed in terms of 
powers of 10. 


Examples: Distance of the Earth from the Moon is given as 3.8 x 108 m, av- 
erage height of an adult as 1.8 x 100 m, radius of a hydrogen atom 
as 0.5 x 10:10 m. 
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2. Base 2: This is used in data processing and information theory (computers). 
3. Base e: e is a special number, known as Euler’s number; its numerical value is 


e = 2.71828... 


The importance of this number and powers of it will become clear in Chaps. 5, 6 
and 10 on differential and integral calculus, and on differential equations. It is of 
paramount importance in higher mathematics. 


4.1.3 Binomial Theorem 


The following identities are well known: 


(a+b)? = a? +2ab+ b? 
(a — b? =a? —2ab+b? 


The general formula for positive integral powers of a sum, (a +b)”, is known as the 
binomial theorem; it states 


-1 = D)n—-2 
(4-5) = a^ хат) J gap? Po DO 29 Do- P 4... 


n(n—1)(n—2)...x2x1,, 
1 x 2x ...(n—2)(n — 1)n 


(4.6) 


The coefficients are known as binomial coefficients. The last coefficient is included 
for the sake of completeness; its value is 1. 


4.1.4 Exponential Function 


The function a* is called an exponential function; x, the index or exponent, is the 
independent variable. 


Example Let y — 2* 


x 2% 
By giving x positive and negative -3 0.25 
integral values, the table on the -2 025 
chase eats nroduced -1 005 
right is easily produced. 0n 5 
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Figure 4.1 shows the graphs of the functions y = 2*, y = e*, y = 10%. 


All exponential functions go through the point corresponding to x = 0, since all 
are equal when y = 1. Exponential functions grow very rapidly, as can be seen from 
the figure, even for small x-values; hence they are not easily represented graphi- 
cally for large x-values. Exponential functions grow much faster than most other 
functions (unless a < 1). 

Rates of growth in nature are described by exponential functions. An example is 
the increase in the number of bacteria in plants. Suppose that cell division doubles 
the number of bacteria every 10 hours. Let A be the number of bacteria at the start 
of an experiment; the table below gives the growth of the bacteria and this growth is 
represented graphically in Fig. 4.2. 


Time Number of Number 

(h) bacteria 30 

0 A 

10 24 20 

0 4 

30 8A 2: 

40 16A 

0 32А O' 10 20 30 40 50 Time (h) 
Fig. 4.2 


The relationship between the growth of the bacteria and the time is expressed 
by means of the exponential function y = A x 29^, The coefficient 0.1 is used 
because after exactly 10 time units (hours in this instance) the number of bacteria 
has doubled. In general this coefficient is the reciprocal of the time 7 required to 
double the bacterial population. Hence we write 


у-Ах201Т, 


Exponential functions, in particular decreasing exponential functions (decay and 
damping), are frequently encountered in physics and engineering. 
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Example Radium is an element which decays without any external influence owing 
to the emission of a and y radiation. Measurements show that the quantity of radium 
decays to half its original value in 1580 years. Unlike the bacteria example, the 
quantity of radium decreases with time. In this case we can write 


y-Ax2** 
The time required for the quantity of radium to decay to half its original value is 


referred to as the half-life tp. 
Hence the law for radioactive decay is given by 


y=Ax2 tlt 


Figure 4.3 shows the decreasing exponential function for the decay of radium. 


Number of 
Ra atoms 
A 
O 1000 2000 3000 
Time (years) 
Fig. 4.3 


The decreasing exponential function describes damped vibrations, the discharge 
of capacitors, Newton’s law of cooling and many other cases. 
Finally, we will mention in passing another exponential function, namely 


x2 


yes 
for positive and negative values of x. The graph of the function is bell-shaped, as 


shown in Fig. 4.4. This function can play an important role, e.g. in statistics (the 
Gaussian or normal distribution). 


M 
1 


Fig. 4.4 
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4.2 Logarithm, Logarithmic Function 


4.2.1 Logarithm 


In Sect. 4.1.4 we considered the exponential function 
y-a* 


We calculated y for various integral values of x. For fractional values of x we need 
to use either tables or a calculator. We will now consider the inverse problem, i.e. 
given y, what is x? 


Example If 10* = 1000, what is the value of x? 

The solution is easy in this case, for we know that 10? — 1000; hence x — 3. 
The required process is to transform the equation in such a way that both sides have 
powers to the same base; thus 

10* = 10? 


Hence, by comparing exponents, we have x — 3. 
Example If 10* — 100000, what is x? 


Step 1: We write both sides as powers to the same base, i.e. 10* = 10°. 
Step 2: We compare exponents: x — 5. 


In both examples we required the exponent to the base 10 which yields a given 
value. This exponent has been given a name; it is called a logarithm. 
The following statements are equivalent: 
x is the exponent to the base 10 which gives the number 100000. 
x is the logarithm of the number 100000. 
This last statement is written as 


x = log 100000 = 5 


In order to avoid any doubts about the base, it is common practice to specify it by 
a subscript. Thus 
х = 1050 100000 = 5 


Example If 2* = 64, what is x? 
We write both sides as powers to the same base, 2 in this case. 


2* 226 Hence x=6 


The exponent which raises the base 2 to 64 is 6. This result can be expressed as 
follows: 
х = log, 64=6 


4.2 Logarithm, Logarithmic Function 75 


Definition Logarithm: The logarithm of a number c to a base a is the ex- 
ponent x of the power to which the base must be raised to equal 
the number c. As an equation, this definition is expressed as 


ава €) = с (4.7) 


We һауе to remember that the logarithm is an exponent. To resolve ће equation 
а^ = с we should proceed in two steps: 


Step 1: Write both sides as powers to the same base, i.e. 
a= a loa c) 


Step 2: Compare exponents 
x = log, С 


The examples we have just considered had integral exponents but in many cases 
where exponents are not integral, expressions cannot be resolved in such a simple 
manner. The case of fractional exponents will be explained in Chap. 8. 

Equations in which exponents appear are more easily dealt with by taking log- 
arithms. Taking logarithms is the transformation which consists of the two steps 
described above. 

To avoid having to write the base below the log as a subscript, which is not very 
convenient, the following notation has been adopted: 


Base 10: This base is mainly used in numerical calculations and it is written as 
Іов уо = lg or log . 


Logarithms to base 10 are also known as common logarithms. 
Base 2: This base is mainly used in data processing and information theory. 
Logarithms to the base 2 are written 


log, = ld 


Base e: Logarithms to the base e are called natural logarithms or Napierian log- 
arithms; they are frequently used in calculations relating to physical problems. 
Logarithms to the base e are written 


log, — In 
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4.2.2 Operations with Logarithms 


Operations with logarithms follow the power rules since logarithms are exponents. 
Thus, for example, the rule for multiplication is simplified to the addition of expo- 
nents, while the rule for division is simplified to subtraction of exponents, provided 
that the base is the same. 

The rules are: 


Multiplication 
log, АВ = log, A + log, B (4.8) 
The logarithm of a product is equal to the sum of the log- 
arithms of the factors. 
Division A 
log, вэ log, A—log, B (4.9) 
The logarithm of a quotient is equal to the difference of 
the logarithms of the numerator and denominator. 
Power 
log, A” = mlog, A (4.10) 
The logarithm of a number raised to a power is equal to 
the logarithm of that number multiplied by the exponent. 
Root 


log, ases - L log, A (4.11) 
The logarithm of the mth root of a number is equal to the 
logarithm of that number divided by m. 
Conversion of a logarithm to base a into a logarithm to another base b is a fairly 
straightforward operation. 
If x=log,c 
then c—a* 
Taking log to the base b on both sides gives 
log, c = log, a* = xlog,a 

Since x = log, c, it follows that 


log, c = log, с X logy a 


1 
1 = ——] 
ог log, С gza Ogp c 


4.2 Logarithm, Logarithmic Function TI 
Example We sometimes need to convert from base e to base 10 and vice versa. 


loge = 0.4343Inc 
Inc = 2.30261ogc 


4.2.3 Logarithmic Functions 


The function y = log, x is called a logarithmic function; it is equivalent to 
х=а? for a»0 


Example 
y=log,x or 2%=x 


Numerical values are given in the table below and the graph of the function is shown 
in Fig. 4.5. 


x y 
025 -2 
0.5 E 
1 0 
2 1 
4 2 
Fig. 4.5 


Figure 4.6 shows the logarithmic functions for three different bases, i.e. 10, 
2 and e. 


23 45 6 7 8 910 


Fig. 4.6 
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All logarithmic functions tend to minus infinity as x tends to zero, and they are 
all equal to zero at x = 1. Logarithmic functions are monotonic as they increase, 
and they tend to infinity as x goes to infinity. The reader should observe that the 
logarithmic function is the inverse of the exponential function. This is shown in 
Fig. 4.7 for bases 2 and e. 


Fig. 4.7 


4.3 Hyperbolic Functions and Inverse Hyperbolic Functions 


4.3.1 Hyperbolic Functions 


These functions play an important role in integration and in the solution of differen- 
tial equations. They are simple combinations of the exponential function e* and e^ * 
and are related to the hyperbola just as trigonometric (circular) functions are related 
to the circle. They are denoted by adding an ‘h’ to the abbreviations for the corre- 
sponding trigonometric functions. Graphs of the hyperbolic functions are shown in 
the following figures. 


Hyperbolic Sine Function 


This function is denoted by sinh (pronounced shine) and is defined as follows: 
: е 
sinh x = ———— (4.12) 


Figure 4.8a shows sinh x and also the functions 1/2e* and —1/2e~*; sinh x is 
obtained by adding them together. 
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cosh x 


(b) 


Fig. 4.8 


We observe that the hyperbolic sine is an odd function: it changes sign when x 
changes sign, i.e. it is symmetrical about the origin. 


Hyperbolic Cosine Function 


This function is denoted by cosh and is defined as follows: 
e 
cosh x = ————— (4.13) 


Its graph is shown in Fig. 4.8b; it is an even function. 
A chain or cable which hangs under gravity sags in accordance with the cosh 
function. The curve is called a catenary. 


Hyperbolic Tangent 


This function is denoted by tanh and is defined as follows: 


x 2x 


sinhx  e*—e- 
tanh x — = 5 
coshx ех +етх 1+е72х 


| l-e* 


(4.14) 


It is an odd function, i.e. its graph is symmetrical with respect to the origin. It 
is defined for all real values of x, and its range is |y| « 1. Its graph is shown in 
Fig. 4.9a. There are two asymptotes: у = 1 and y = —1. 
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Fig. 4.9 a Hyperbolic tangent; b Hyperbolic cotangent. Dotted line hyperbolic tangent 


Hyperbolic Cotangent 


This function is denoted by coth and is defined as follows: 


coshx Е БЕ - Бали ae. (4.15) 
sinh x ех ех 1—e ^*  tanhx 


coth x — 


It is an odd function. It is defined for all real values of x, except x = 0. Its graph 
is shown in Fig. 4.9b; and it lies above 1 and below —1. It is asymptotic to y = 1 
and y = —1. 

An examination of the graphs of the hyperbolic functions reveals that they are 
not periodic, unlike the trigonometric functions. 

There are a number of relationships between the hyperbolic functions which we 
will not discuss here. We will, however, derive an important one because of its sim- 
ilarity with the corresponding identity for trigonometric functions, i.e. 


2 


sin? х +cos? x = 1 


Now consider the corresponding hyperbolic functions. We have 


1 1 

sinh? x = —(e* —e^*)? = -(e?* –2+е 2%) 
4 4 
1 1 

cosh? x = ri +e7*)? = 166 4e 5 


By subtraction we find 


cosh? x — sinh? x = 1 (4.16) 
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4.3.2 Inverse Hyperbolic Functions 


The hyperbolic functions are monotonic except for cosh x. Hence we can form the 
inverse functions, except that for the inverse cosh x the range will be restricted to 
positive values of x. Figures 4.10 and 4.11 show the graphs of the inverse functions. 
They are formed by reflection in the bisectrix in the first quadrant. 


(a) (b) 
eee ee restricted function 
inverse function 
+++++++ original function 


Fig. 4.10 a Inverse hyperbolic sine; b inverse hyperbolic cosine 


Inverse Hyperbolic Sine 


y=sinh!x (Fig. 4.10a) 


It is defined for all real values of x. The following identity holds true: 


sinh! x = In(x + V x2 + 1) (4.17) 


Inverse Hyperbolic Cosine 


y=cosh !x (Fig. 4.10) 


It is defined for x > 1; hence y > 0. The following identity holds true: 


cosh”! x = In(x + V x2 — 1) (4.18) 
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Inverse Hyperbolic Tangent 
y-—tanh !x (Fig. 4.11) 


It is defined for |x| < 1. The following identity holds true: 


1. 1 
tanh ! x = z” T 


(4.19) 
—x 


Inverse Hyperbolic Cotangent 


y-—coth!x (Fig.4.11) 
It is defined for |x| > 1; its range consists of all real numbers except y = 0. The 


following identity holds true: 


1. 1 
со ^! x = -1n aged 


(4.20) 


2 x-1 


Fig. 4.11 
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Exercises 


If you have solved the exercises you may try to solve them again with the aid of 
computers using programs like Mathematica, Derive or Maple. 


Calculate the terms given in the next questions or give a transformation: 


1. (a) ат" (b) 273 
(с) ат (4) (0,1) 
(e) (3? (f) x-3 
(в) 109.1073. 10? (h) 33? 
2. (a) (V2) (b) ет 
(c) (In2)° (d) /5-47 
(e) (0, 5)? - (0, 5)-^ - (0, 5) (f) У8-ү3 
1 
3. (a) 16100 ОД тт 
(с) 10-1610 (d) 16106 
(e) 101810 (f) (Ig 10)'0 
4. (a) 148 (b) 140, 5 
(с) 1425 (d) (ад) 44 
(e) а3144 (f) (42) 
(g) 214а (8) 2142 
5. (а) elne (b) eln57 
(c) Ine? (d) (e НАР 
(e) (Ine)e* (f) In(e-e*) 
1 1 
6. (a) 1610 (b) gs 
(c) In(e2* . e5*) (d) -lga 
(e) 1d (4") (f) т-145 
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(a) In(a-b) (b) lg x? 

(c) 1d(4- 16) (d) ld x 
et ale 10* 

(e) In(e?* -e5*) (f) lg 13 


. Calculate the inverse functions: 


(а) y 22x —5 (b) у 2 &? +1 (c) y =In2x 


. Calculate the function of a function: 


(а) у=и?, и= (х) =х-1; Wanted: y= f(g(x)) 
у= T, uou Wanted: у = f(g(x) 
(с) y 242-1, u-vVx5-2; Wanted: y= f(g(x)) 
(4) у= 5и, u—g(x) 2x?-4; Wanted: y= f(g(1)) 
уау, uci: Wamed: y= /(80)) 


(f) y =ѕіп(и+л), u= : Wanted: у = f(g(1)) 


Chapter 5 
Differential Calculus 


5.1 Sequences and Limits 


5.1.1 The Concept of Sequence 


As a preliminary example, consider the fraction | /n. 
By giving n the values of the natural numbers 1,2,3,4,5... successively, we 
obtain the following sequence: 


These values are illustrated graphically in Fig. 5.1. 


зін 


a 
^ 
li 


noA O Ne 
a a 
^ e 
1 li 
tale Ble gie WIS pa 


Fig. 5.1 


In this example, 1/п defined the form of the sequence and to n we assigned 
the values of the natural numbers. The functional representation of the terms of 
a sequence is usually denoted by ал, so that the sequence becomes 


@1,42,43,...,4n,Gn+1,--- 


This can be abbreviated thus: {an } 
dg is the nth term of the sequence, sometimes referred to as the general term. 


K. Weltner, W. J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
ISBN 978-3-642-00172-7 © Springer 2009 
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Example Let ад be given by 


1 
шон n(n+ 1) 
It then gives rise to the following sequence: 


1 1 1 
1x2! 2x3’ 3x4?’ 


Example Let ал be given by 
аһ = (—1)"n 


It then gives rise to the following sequence 
—1,2,—3,4,—5,6;... 


Example Let a; be given by an = aq" , where a and q are real numbers. By giving 
n the values 0,1,2,3,... the following sequence is obtained: a,aq,aq? ,aq? ,... This 
type of sequence is called a geometric progression (GP), and q is known as the 
common ratio. 

A sequence may be finite or infinite. In the case of a finite sequence, the range of 
n is limited, i.e. the sequence terminates after a certain number of terms. An infinite 
sequence has an unlimited number of terms. 


5.1.2 Limit of a Sequence 


Consider the sequence formed by an = 1/n. If we let n grow indefinitely then it 
follows that 1 /n converges to zero or tends to zero. This is expressed in the follow- 
ing way: 


1 
——0 as nee 
n 
or lim — =0 
non 


Zero denotes the limiting value of 1/n as n tends to infinity. Such a sequence is 
referred to as a null sequence. 

The sequence whose general term is a; = 1+ 1/n, on the other hand, converges 
to the value | as n increases indefinitely. In general, the limiting value of a sequence 
may be any number g. 


Definition If a sequence whose general term is an converges towards a fi- 
nite value g as n — ce, then g is called the limit of the sequence. 
This is written as lim dy = g. 


n-—oo 
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A precise mathematical definition is as follows: 

The sequence formed by the general term ад is said to converge towards the con- 
stant value g if for any preassigned positive number e, however small, it is possible 
to find a positive integer M such that 


lan —g| <= forall п> M 


If a sequence converges towards the value g it is said to be convergent. A sequence 
that does not converge is said to be divergent. 

The following examples are given without proof to illustrate convergent and di- 
vergent sequences. 


Convergent Sequences 


Example The sequence defined by an = п/л + 1 has the limiting value 1 as n — оо. 
This is because the number 1 in the denominator becomes less and less important 
as n becomes larger and larger. This sequence is illustrated in Fig. 5.2. 


n 
n On = 
" n+1 
1 
1 2 
2 3 
3 
3 4 
4 
4 5 
5 
5 6 
6 
6 7 
Fig. 5.2 


Example The sequence defined by an = 2 + (— 1/2)" has the limiting value 2 as 
п — оо, This sequence is illustrated in Fig. 5.3. 
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Fig. 5.3 
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Example A sequence of great significance is defined by а, = (1-- 1/1)". This 
sequence has a limiting value which is denoted by the letter e, named after Euler 
who discovered it. 


Definition Euler’s number: 


1 
e= lim (1+2) = 2.718281828... 


n—oo 


Example The following limit, given here without proof, will be used in Sect. 5.5.3: 


lim (e^ -1)n=1 (5.1) 
n-oo 
Divergent Sequences 

Example The sequence defined by ад = n? grows beyond all bounds as n — оо. 
This sequence is illustrated in Fig. 5.4. 


n an =n 

1 1 

2 4 

3 9 

4 16 
Fig. 5.4 


Example The sequence defined by an = 2” /п2 grows beyond all bounds as п — ce. 


Example The sequence defined by an = (—1)" n/(n + 1) has no limit. It oscillates 
between the values +1 and —1 as п — со, This sequence is illustrated in Fig. 5.5. 
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Fig. 5.5 
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5.1.3 Limit of a Function 


The concept of the limit of a sequence can be extended without difficulty to func- 
tions. Consider the function y = f(x). The independent variable x can take the 
values x1,x2,... If these values do not exceed the domain of definition of the func- 
tion f(x), then the corresponding values Ул = f (xn) form a sequence of values 
for y. 


Definition Within the domain of definition of the function y = f(x) we 
take out all possible sequences | Хус) which converge towards 
a determined fixed value xo. 
If {yn} = (f (xn)} tends to one single value, g, for all {xn}, 
then we call g the limit of the function f(x) as x — xo. 


We say that the function f (х) converges and write 


lim f(x)= g if x tends to the finite value xo 
XXQ0 


lim f(x) = = if x tends to infinity (оо) 


Х--оо 


1 

Example у = — for x — ee 

Let us assume that x takes the values 1,2,3,... successively. We then have a se- 
quence whose general term is a, = 1/n and which tends to zero as n increases 
beyond limit. But we could equally let x run through sequences such as 1, 3, 9, 27, 
... 0r 3/7,6/7,9/7,12/7,..., or indeed many other sequences of real numbers. In 
each case we shall find that y tends to zero, 
ie. y = 1/x has the limit g = 0 as x — eo. 

1 


Hence lim ——0 
Xo Х 


5.1.4 Examples for the Practical Determination of Limits 


Up to now we have not given a clear and precise procedure for obtaining limits. 
In fact, such a procedure is not readily available; but to some extent the successes 
in obtaining the limits in certain cases give rise to a procedure for achieving our 
objective in other cases. This is illustrated by the following examples. 
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2 


Example lim —— —— 
un Rupe TONES 


We know that lim 1/x — 0; hence, if we divide each summand of the fraction by 
Х--эсо 


the highest power of х, we get terms in the denominator which vanish if x — о. 


d Ч (if x £0) 
= lix 
x2+x+1 141/х--1/х2 
1 1 j 
As aoe ong and 2500 Henge BEIF ийн 


. sinx : : 
Example lim — (х in radians) 
х-0 x 


This is an important limit as we shall see later when we calculate the differential 
coefficient of the sine function (Sect. 5.5.3). 

Figure 5.6a shows a circular sector OAB of unit radius. If x is the angle be- 
tween the radii OA and OB, then it follows from the definitions of the trigonometric 
functions that BD = sinx,OD = cosx and AC = tanx. Also, x is the length of the 
arc AB. 


~. 
tan х 


(а) (b) 
Fig. 5.6 


Now consider the areas of the triangles ODB and OAC and the area of the sector 
OAB shown in Fig. 5.6b. We see that the area of the sector OAB is greater than the 
area of the triangle ODB and smaller than the area of the triangle OAC, i.e. 


Area A ODB < Area sector OAB « AreaA OAC 
sinx cosx 2 х T tanx 
2 2 2 


Dividing by sin x /2 gives 


xX 1 
cosx < —— < 
sin x COS X 
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We take the reciprocal 


1 sin x 
> — > cosx 
x 


COS X 
As x — 0,cosx — 1 and hence sin х/х lies between two expressions which both 
tend to the limit 1. It follows, therefore, that 

lu Eo (5.2) 


х-0 x 


5.2 Continuity 


If a function takes a sudden jump at x — xo it is said to be discontinuous; if, on the 
other hand, no such jump occurs then the function is said to be continuous. 
This is illustrated in Fig. 5.7a and b respectively. 


(a) Discontinuous function (b) Continuous function 


Fig. 5.7 


Definition The function у = f (x) is continuous at the point x = xo if the 
following conditions are satisfied: 
f (x) has the same limit g as x — xo whether xo is approached 
from the left or from the right on the x-axis. This limit g agrees 
with the value f'(xo) at x = хо. 


A notation for the limit approached from the right-hand side is 


lim f(x) 


x—xo+0 


When the limit is approached from the lift-hand side we write 


lim f(x) 


x—xo—0 


Hence the function f (х) is continuous at x = хо if 


lim f(x)= lim f(x)= lim f(x)= f( lim x)= f (xo) 


x—xo—0 x—xo+0 x—>xo XX 
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5.3 Series 


A series is formed by adding the terms of a sequence or progression. 
As an example, consider the sequence 


1 111 1 1 
”273747777 ий? lr 


By adding the terms we obtain the series 


1+ l + ! + : Tec І + : 
2 3 4 n r 
We should note the following: 
Sequence: @1,d2,d3,...,dn,..-,@r 


Series: a1 +42 + аз+:::4+аһ +- а 


a, is referred to as the leading term, 

an the general term, 

аг the last or end term, 

nis a variable number and assumes all values between 1 and r. Other letters such 
as i, j,k are often used to denote the variable. 


To indicate a series, the Greek letter sigma (X) is used to avoid writing down 
long and complicated expressions. 


5 
a, +42 +аз+::+а; = У an = 8, 


n=1 
5, denotes the sum of r terms. 


ў 
The notation У, ад means that n takes on all the values between 1 and r, e.g. 
n=1 


a2 = 82 


Mr 
2 

E 
! 

2 


E 
ll 
- 


Ра + аз ад = $4 


Ma 
N 

E 
! 

= 


E 
ll 
- 


The variable is completely determined by its limits (1 to r), and it does not matter 
which letter is used to represent it. 
As an example, consider the series formed by adding the squares of the natural 
numbers 
1? +22 632 4? x n? peor. 


Here the general term is 


аһ =n? 


By using the summation sign, the series is expressed as follows: 
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r 
У n? =1? 4243r? 
n=1 
If a series has an infinite number of terms, such that r — ee, then we write 


Tr оо r— oo 


3 
lim 5, = lim У, an 
n=1 


This is usually written 
5 m У an 
n=1 


Such a series is referred to as an infinite series. 

We should note, however, that, strictly speaking, this summation indicates a lim- 
iting process. r — œ means that we take г as large as we please. S for г — © is 
a limiting value of (S, }, provided it exists. 


5.3.1 Geometric Series 


The following series is called a geometric series, the sum of the geometric progres- 
sion (GP) 

a-- aq t aq? c aq? 4 aq" s 
The sum of the first r terms of this series is 


n-r-1 


$,— > aq” 
n=0 


To obtain an expression for this sum we multiply the original series by the common 
ratio g and then subtract the original series from the new one: 


5,9 =  aq+aq?+---+aq"’"!+aq" 


S, =a+aq+aq?+---+aq 
Subtracting gives 
Sq — Sr = —a + aq 
or Sr(q — 1) =a(q" — 1) 
Thus we get: 


q'—1 1-4" 
4-1 1-4 


Geometric series: S, = а for 4571 (5.3) 


This is the sum of the first r terms of the GP. 
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To obtain the sum for an infinite number of terms we need to find the limit of 5, 
as r — œ. We have to distinguish between the following two cases: 


Case 1: (4| « 1 
Here lim q” =0 
F—oo 
me = 1-4 а 
Hence 5 = jim 5» = lim ад = 15; 


Case 2: |q| > 1 
In this case q” grows beyond all bounds as r — со and the geometric series has 
no finite limit. 


5.4 Differentiation of a Function 


5.4.1 Gradient or Slope of a Line 


Definition The gradient of a line is the ratio of the rise Ay to the base line 
Ax from which this rise is achieved. 


The symbol A is the Greek letter ‘delta’ and is used here to mean the ‘difference 
between'. Hence Ax does not mean A multiplied by x but the difference between 
two values of x such as Хү and x5, i.e. Ax = x2 — x4. 

The gradient or slope is also given by the tangent of the angle of elevation o, as 
shown in Fig. 5.8. 


A 
шон tana 
Ax 
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5.4.2 Gradient of an Arbitrary Curve 


The gradient of an arbitrary curve, unlike the gradient of a line, varies from point 
to point, as can be seen by examining the curve shown in Fig. 5.9. If we could find 
the gradient at some fixed point P on the curve, then the line through P with the 
same slope is called the tangent to the curve at P, and we can use as synonyms the 
expressions ‘gradient of the curve’ and ‘slope or gradient of the tangent’. 


Fig. 5.9 


The problem now is to find an expression for the gradient of any curve at a given 
point P. 


Consider a point P on the curve 
y = f(x) shown in Fig. 5.10a and 
a neighbouring point Q. The line 
drawn through P and Q is called the 
secant, whose slope is 

Ay 


tana’ = — 


Ax 


We have 


Ay = f(x+Ax)— f(x) 
(see Fig. 5.10b). 


Fig. 5.10 
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With P fixed for the moment, let the point Q move towards P. It follows that in 
the limit, when О coincides with P, the angle o is equal to the angle о, the slope of 
the tangent to the curve at P. As Q gets nearer to P we notice that Ax tends to zero 
and, as a consequence, Ay tends to zero also; but the ratio Ay /Ax tends to a definite 
limit since the secant PQ becomes the tangent to the curve at P. Hence 


A Ax) — 
tana = a, tana’ = lim ae lim ЛЕРАХ) 
Ax—0 AX Ах-0 AX 


This is the slope of the tangent at P. 


Definition 
The fraction Аав 55 Д = BY is called 
Ax Ax 
the difference quotient . (5.4) 


Example Calculate the slope of the parabola y = x? at the point P = (1/2,1/4). 
From Fig. 5.11, the slope of the secant PQ, where Q is any other point, is 


f(x t Ax) - f(x) 
Ax 
We wish to obtain the slope of the tangent at P. We know that 
f (x-- Ax) = (x - Ax? 


Therefore the slope of the tangent at P is 


tana’ = 


_ (x +Ax)? — x? 
tana = lim 2———————— 
Ax—0 Ax 
This reduces to 
tana = lim (2x + Ax) 


2-3 


Fig. 5.11 
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As Ax — 0 we have, in the limit, 


tano = 2x 


At the point P(1/2,1/4) the slope is tana = 2x 1/2 = 1, giving a = 45°. 

It is not always true that the difference quotient has a limit as Ax — 0 because 
not every curve f(x) has a well-defined slope at a particular point. For example, 
consider the point P on the curve shown in Fig. 5.12. 


Fig. 5.12 


5.4.3 Derivative of a Function 


Moving from the geometrical concept above to the general case, we consider the 
difference quotient of a function f(x), namely 


Ay _ f(x+Ax)— f(x) 
Ax Ax 


Definition If the difference quotient Ay /Ax has a limit as Ax — 0, this 
limit is called the derivative or differential coefficient of the 
function y = f(x) with respect to x and we write 

dy Ay 


dx — AH Ax (om) 


This differential coefficient is denoted by у’, f'(x) or dy/dx. It must be clearly 
understood that the d does not multiply y or x but is the symbol for the differential 
of y or x; dy/dx is read as ‘dy by dx’. 

Using the above notations, we have 

dy d Ay 2 f(x Ax) – f(x) 


(х) = = = — = lim == = 1 
y =f) dx ax? (0 AXDO AX Axo Ax 
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We have thus defined analytically the first derivative by a limiting process which we 
can also interpret geometrically as the slope of the tangent to the curve at a point x, 
as shown in Fig. 5.13. 


Fig. 5.13 


By this process we have obtained substantially more than we had postulated; 
instead of obtaining the slope at some fixed point P, we have, in fact, obtained the 
slope as a function of the independent variable x. 

The importance of the differential calculus lies in the fact that it describes rela- 
tionships between variable entities. The differential coefficient y' gives the rate of 
change of y with respect to x. In the next section we will look at an example taken 
from the physics of motion. 


5.4.4 Physical Application: Velocity 


The vehicle shown in Fig. 5.14 is observed to cover a distance Ax in a time At, i.e. 
we start the clock at some time ¢ and stop it at a time ¢ + At. The magnitude of the 
average velocity, the rate of change of displacement with time, is given by 


_ Ax 
EV 


This expression gives us an average value only: it does not tell us how fast the 
vehicle is moving at a particular instant in time, i.e. we do not know its instantaneous 
velocity v(t). 

The smaller we take At, and hence Ax, the closer we get to the value of the 
instantaneous velocity at a particular time. Figure 5.15 shows the vehicle travelling 
a shorter distance Ax which it will cover in a shorter interval of time At. 

We now define the instantaneous velocity as the first derivative of the position 
coordinate x with respect to time: 


vo 


(0) = lim & 

V = vue Ar 
dx 

t= — = 

or v(t) PT 
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At 
Fig. 5.14 Fig. 5.15 


The ‘dot’ notation is frequently used when calculating derivatives with respect to 
time. This limiting process with At — 0 is one of the fundamental mathematical ab- 
stractions of physics. Although we are not able to measure arbitrary small times, we 
are nevertheless justified in taking limits with respect to time because we can draw 
conclusions which can be verified experimentally. The expression for the velocity at 
any instant of time ¢ has been written v(t) to imply that the velocity itself may vary 
with time. This variation, i.e. the rate of change of velocity with respect to time, is 
referred to as acceleration. This is a quantity which has a relationship with another 
measurable physical quantity — the force as shown in treatises on mechanics. 
Hence the acceleration a(t) is defined by 


(f) 1 Av dv 
а(1) = lim — = — = 
At—0 At dt 


5.4.5 The Differential 


We have defined the derivative or differential coefficient as 


dx | Ax20 Ax 


where dy /dx was not to be regarded as dy divided by dx but as the limit of the 
quotient Ay /Ax as Ax — 0. There are, however, situations where it is important to 
give separate meanings to dx and dy. 

Let us arbitrarily assume that dx is a finite quantity! dx is called the differential 
of x. Consider two points P and Q on the curve y = f (x) shown in Fig. 5.16. 

In going from P to Q along the curve, y changes by an amount Ay given by 


Ay = f (xt Ax) - f(x) 
The tangent to the curve at the point x changes by an amount 
dy = f'(x) dx 


during the same interval Ax — dx. dy is called the differential of the function 


у= f(x). 
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Fig. 5.16 


We see quite clearly that, in general, the differential of the function is not equal 
to the functional change in y, i.e. 


dy + Ay 


Thus the differential dy is an approximation for the change Ay: the smaller the 
interval Ax, the better the approximation. Hence, as soon as we are able to calculate 
the derivative y’ of a function we are also able to calculate its differential. 

The differential is used extensively as a first approximation for the change in 
the function. Geometrically, it means that the function is replaced by its tangent at 
a particular point. 


Notation 
x — independent variable 
y — dependent variable 
dx — differential of the independent variable x 
dy = differential of the dependent variable y , 
ie. ау = f'(x)dx 
dy is often replaced by d f 


5.5 Calculating Differential Coefficients 


We first demonstrate the calculation of differential coefficients for power functions. 
The calculation of the difference quotient and the limiting process is easy. 

In the following sections, we will derive some general rules for calculating dif- 
ferential coefficients. Using these rules, we will be able to treat the functions most 
often used in practical applications. 
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5.5.1 Derivatives of Power Functions; Constant Factors 


First we state the formula for obtaining the derivatives of power functions. 


If у= f(x) = х" wheren is any rational number 


then y'—nx"-! (5.6) 


The proof is given only for the special case when 7 is a positive integer. 
We start by investigating the difference quotient: 
Ay.  (x+4x)”— x” 


Ax Ax 


Expanding the term (x + Ax)" by the binomial theorem (cf. Sect. 4.1.3) gives 


Ay х" +nx™-lAx+---+(Ax)” — x" 
Ax | Ax 
nx" Ax bes (Ах)! 
Е Ах 


Factorising Ax gives 


A -1 

АТ =nx™ la D (АЖ) 
We proceed with the limiting process. Making Ах — 0 results in all terms vanishing 
except the first one: 

у! =n х1 
Example If у = x?, then n = 3. 
Applying the above rule gives 
yi Sant 6 

It can be shown that if n is a negative integer, п = —a, then y' = —ax 81) 
It can also be shown that if y = х?/4, where p and q are both integers, then 


y = 2. ҳ(р/9)-1 
q 


Hence the rule applies whether n is positive, negative or a fraction. 


Example If y = 1//x = x "2, i.e. n = —1/2, 
then y’ = —1/2x-U/2-1 = —1/2x 3? 
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The derivative of a constant vanishes: 


у(х) = с, c= constant 


The graph of this function is shown іп 
Fig. 5.17. It is parallel to the x-axis. The 
slope is zero. This obvious result is also ob- 
tained by systematic calculation: 

А 


cc d AX) TG 
> = Ax 
c—c 


= lim 
Ax—0 Ax 


5.5.2 Rules for Differentiation 


Constant Factor 


A constant factor is preserved during differentiation: 


у =cf(x) where с is a constant 
y суа) (57) 


Proof We can take out the constant c and place it in front of the limit sign, since it 
is not affected by the limiting process 


“| Chet Ax) = ef (x) 


/ 
=] 
ши Ах 
sas ажа) 0) 
Ax—0 Ax 
Hence y =cf' (x). 


Differentiation of a Sum: Sum Rule 


The derivative of the sum of several functions is the sum of the individual deriva- 
tives: 


y = u(x)  v(x) 
у= u'(x) 4 v'(x) (5.8) 
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Proof We separate the limit into a sum of limits. 
By definition, 


/ 


yh iim u(x +Ax) + v(x + Ax) — u(x) — v(x) 
Ax—0 Ax 


Provided that the limit of each function exists we can separate the two functions, 
so that 


Hence у= u'(x) - v(x). 


The rule applies equally well to the sum or difference of two functions and to the 
sum or difference of several functions. 

Generally, the derivative of the algebraic sum of n functions is the algebraic sum 
of their derivatives: 


If y = ui(x) +u2(x) +::: un(x) 


then yl =u (x) +и2 (x) +++ ип (х) 


Product of Two Functions: Product Rule 


If u(x) and v(x) are two functions, the derivative of the product is given by the 
following expression: 


(x)v(x) + u(x)v(x) (5.9) 


Proof By definition, 


u(x +Ax)v(x + Ax) – u(x)v(x) 
Ax—0 Ax 


Adding and subtracting u(x)v(x + Ax) to the numerator gives 


12877 u(x 4- Ax)v(x + Ax) — u(x)v(x) +u(x)u(x + Ax) — u(x)v(x + Ax) 
у 7 Ах-э0 Ах 


Collecting terms in such a way that difference quotients are formed gives 


А -FAx) – u(x) v(x 4- Ax) — v(x) 
TA u(x 
Y = s Ax Ax 


u(x) 


v(x +Ax)+ lim 
Ax—0 


Hence — y'-uw'(x)v(x) + v'(x)u(x) 
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Quotient of Two Functions: Quotient Rule 


If u(x) and v(x) are two functions, the derivative of the quotient is given by the 
following expression: 


u(x) 
УС wx) 
y = edv) ибх) (а) eum 


The proof follows the pattern given above and is omitted here. 


Derivative of a Function of a Function: Chain Rule 


If g(x) is a function of x and f(g) is a function of g, let y = f(g), i.e. 
у= f(g(x)) 


y is said to be a function of a function. Its derivative is obtained by differentiating 
the outer function f with respect to g (written d f /dg) and the inner function with 
respect to x and multiplying the two derivatives. 


y = f(g(x)) 
df 
= 9! 5.11 
y 458 0) (5.11) 
Proof By definition, 

d d Ag) – Ax) – 
dy df q,-4(8*48-f(s) ,, S im EETA EO) 
dg dg  A4g—0 Ag dx Ах-0 Ax 
d 
eel lim AF and ad c: lim = 
dg  Ae—0 Ag dx — Ax20 Ax 


Af А 
Before proceeding to the limit, consider the product $ z E. 
This is equal to Ay /Ax. 


d ^ ^ 
B cau c тыа 


Th = 
m dev Ae eo ME, X 
e dy _ df dg 
ВРЕ ах — dg dx 
d 
Hence y= 7 ——g' (x) 


dg 
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Example 
y =(14+x7)3 
g(x) = 1+х2 (inner function) 
f(g)-8g ? (outer function) 
d 
4 =3g? and g'(x)=2x 


y! 23g? x 2x = 6(1-- x? x 


Derivative of the Inverse Function 


If the function f (x) is differentiable in a given interval where f'(x) Z 0, then the 
inverse function f^! (x) possesses a derivative at all points in the corresponding 
interval. The following relationship holds true: 


d Й 1 

acf (xo) = [£7 (x9)] = = 
icum 

To demonstrate this, consider the graphs of f~!(x) and f (x), as shown in Fig. 5.18. 

It will be remembered from the geometrical correlation discussed in Chap. 3, 

Sect. 3.5 that these graphs are symmetrical about the bisection line. 


fx) 


Fig. 5.18 


At a point P = (xo, yo) the slope of the curve is the derivative of the function 
f (xo). We will now determine the slope. 


tano — [£^ (xo) 


/ 
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We denote by o the angle made by the tangent to the curve at P and the x-axis. The 
slope of the tangent measured with respect to the y-axis is denoted by 0. Further- 
more, the slope of the tangent to f (уо), at the position symmetrical to Р, is also 0. 
(Note: yo = f~!(x0).) 

гаад = f'(yg) 


Since a + 0 = 90°, it follows that tana = cot@ = 1/tan0. 


Hence [f-}(x)]/ = = 


This formula can be rewritten as follows: 
f'(y) is the derivative of f at the point y; remember that y = f~1(x) means 
x = f(y). Thus dx/dy = f'(y). If we insert dx /dy in the formula we obtain 
dy 1 


dx  dx/dy 


Derivative of the inverse function y = f^! (x): 


dein deo d 
a = a Fe) 
dy | | 


5.5.3 Differentiation of Fundamental Functions 


We now evaluate the differential coefficients for functions which are frequently 
used. Fortunately, we do not, in each individual case, have to carry out the limit- 
ing process for the function f (х) under consideration: 


The basic difficulty in obtaining this limit is that the numerator and the denominator 
of the difference quotient both become zero as Ax — 0, giving the expression 0/0. To 
overcome this difficulty we try to transform the difference quotient in such a way 
that the denominator does not become zero during the limiting process; this can 
only be achieved with some fundamental functions like power functions. In some 
cases (for example with sine functions and exponential functions) we are forced to 
carry out the limiting process. But in most other cases we may reduce the differ- 
ential coefficient to the known differential coefficients of other functions, using the 
differentiation rules derived in the previous section. The following brief proofs for 
a number of fundamental functions illustrate this. 


5.5 Calculating Differential Coefficients 107 


Trigonometric Functions 


y = sinx y =cosx 
y = cosx у= —sinx 
1 
24 = =1+tan? 
y anx y 23 + tan” x 
-1 
y =cotx yess = —1-cot” x (5.13) 
sinf x 


In the case of the last two derivatives we must exclude the values of x for which the 
denominator becomes zero. 


Proof Sine function 
We start with the difference quotient 


Ay _ sin(x +Ax) —sinx 


Ax | Ax 
Ay  2sin(Ax/2)cos(x + Ax/2)* 
Ax Ax 

| sin(Ax/2) 

= ~(Ax/2) cos(x +Ax/2) 


We saw in Sect. 5.1.4 (5.2) that 


sin Ax 
im = 
Ах-д0 Ax 
d in(Ax /2 A 
Hes ан gm, SA oo (s) on 


Proof Cosine function 


: л 
y =cosx = sin (5-2) 


We apply the chain rule for a function of a function with 


g(x) =х+5 f(g) =sing 


Differentiating gives 


cemere) 
y =cosgg = cos Wars 
= —sinx 


* To obtain this transformation, we use the relationships from Chap. 3, p. 67: 


sina — sin В 2 (5.25 232) 


In our case œ = x + Ax and В = x 
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The derivatives of the tan and cot functions can be obtained by applying the quotient 
rule: 


sin x , cos? x — (— sin? x) 1 
y = tanx = y. y= 2 == 2 
COS X cos* X cos* x 
COS X р —sin? x — cos? x -1 
and y —cotx = — y y= —3 = — 
sinx sin^ x sin^ x 


Example The vibration equation. A sine function with an arbitrary period has 
the form 


2 
y-sinax Тһе period is E 
a 


This can be treated as a function of a function with g(x) = ax and f(g) = sin(g). 
The derivative is obtained by means of the chain rule. First we differentiate f 
with respect to g and then g with respect to x. 


f(g) — sin(g) dL = cos(e) 
gasas 8 (х) а 


Hence, by the chain rule, 
у! =acosax 


In physics and engineering, we often have to deal with quantities which depend on 
time. Mechanical and electrical vibrations are typical examples. 

A vibration with an amplitude A and a frequency o (also referred to as circular 
frequency) is described by the equation 


x = Asin(ot) 


(When there is no possible confusion, the bracket around wt may be omitted, so that 
x = Asinot.) 
To obtain the velocity of the vibration we have to differentiate this equation with 


respect to the time f: 

(t) ах. 

v(t) 2 — =x 
dt 


Remember that the ‘dot’ above the x indicates differentiation with respect to time f. 


Hence 
v(t) = x = oAcos(ot) 


since A is a constant factor which remains unchanged during differentiation. The 
rest of the equation is identical to the equation y = sinax where a replaces о, x re- 
places f, y replaces x. 
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Inverse Trigonometric Functions 


1 
— , 
= sin X ә: 
y M Е 
-1 
y=cos x y = ee 
УТ-х2 
Ї 
у= (ап! х y= 12532 
-1 
у= соі! х y = EF (5.14) 


To prove the derivatives of the inverse trigonometric functions we use the general 
equation derived in Sect. 5.5.2, i.e. 


dy | 1 


dx  dx/dy 


Proof Derivative of the inverse sine function 


у= віп! х 
x =siny 


We differentiate with respect to y, obtaining 


d 
dy TOY = y 17 si y = 1-2 


: dy 1 
Since er Tay 
7 1 


it follows that y = — 
У1-х2 


Other proofs follow the same pattern. 


Exponential and Logarithmic Functions 


(5.15) 


According to (5.1), lim (e!/” — 1)n = 1. 
n—oo 
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This limit remains valid if we substitute any suitable sequence of numbers for n. 
If we substitute 1/n = Ax then, as n — ee, Ax — 0. Hence 


ie 8 
AEQ AES T 
dy , х 
С ty == = у= 
onsequenly 2 = у =e 


Proof Logarithmic function 


y —Inx (log to the base e) 


This function is equivalent to 


e? =e™ = х 


We now obtain the derivative of x with respect to y: 


S 
dy 
Remembering (5.12) 
dy | 1 
dx  dx/dy 


we find 


Comments on the Importance of the Exponential Function 


We notice the exponential function, y — e*, remains unchanged when differenti- 
ated, i.e. y' = y. According to our geometrical interpretation of the derivative (see 
Sect. 5.4.3), у! indicates how y changes with x. Therefore this function will play an 
important role in all fields where the rate of change of a function is closely related 
to the function itself. This is, for example, the case with natural growth and decay 
processes. 

The equation y' = y is, by the way, the first ‘differential equation’ encountered 
in this book. It is called a differential equation because it involves not only y but 
also the derivative of y. We note that the function y = ех satisfies this differential 
equation; it is said to be a solution of y’ = y. We shall use this also when we consider 
the solution of other differential equations (Chap. 10). 


Hyperbolic Functions 


The derivatives of hyperbolic functions and their inverses have a special significance 
in the evaluation of certain integrals. 
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y —sinhx y! =coshx 
y = coshx y! =sinhx 
= tanh x LES = 1 — tanh? x 
3 cosh? х 
1 
y =cothx y= = 1 —coth? x (5.16) 
sinh* x 


The proofs of the derivatives are quite straightforward. We shall concentrate on 
the derivative of sinh x. 

The derivative of the hyperbolic cosine function can be obtained in a similar 
manner. The derivatives of the hyperbolic tangent and cotangent can be obtained 
using the quotient rule. 


Proof Derivative of the hyperbolic sine 
y =sinhx = нэ —e7) 
We know the derivatives of the exponential functions, 
(e ее, (eye 


Hence 


1 
y'= 28 +e *) = cosh x 


Inverse Hyperbolic Functions 


1 
pd 1 
y =sinh x y= 
У1--х2 
1 
у —cosh Їх y = ——=— (x>!) 
х2-1 
= 1 
y = tanh rx а рет (|x| < 1) 
2 1 
узсой Їх yen (x| > 1) (5.17) 


The derivatives of tanh ^ ! x and сой! x look identical. But they do differ in their 
domain. 


Proof Derivative of the inverse hyperbolic sine function 


y= sinh”! x 
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We use the rule for inverse functions: 


dy 1 
dx ^ dx/dy 
If y= sinh ^! x 
then x — sinh y 
Thus d = сону = y1 +sinh? y = 1+ x2 
It follows that i 
А 


ye EE 


5.6 Higher Derivatives 


The differential coefficient of the function y = f (x) not only gives the slope of the 
function at a particular point but also gives the slope at every other point within the 
range for which the function f(x) is defined and for which the derivative exists. 
The differential coefficient is itself a function of x. 

This, therefore, suggests that we can differentiate the derivative f'(x) once more 
with respect to x. In this way the second derivative of y = f (x) with respect to x 
is obtained (Fig. 5.19). 


Definition The limiting value 


fan ОА) JG) 
x0 Ax 


= f'(x) = y"(x) (5.18) 


is called the second derivative of y — f (x) with respect to x. It 
is denoted by f" (x), y"(x), d/(dx) (dy)/(dx), (d? y)/(dx?), 
or (d) /(dx2) f (x). 

(92у) / (dx?) is read as ‘d-two y by dx squared' !) 


Fig. 5.19 
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This second derivative will, in general, be a function of x and we can obtain the 
third derivative of f" (x) = d? у/ах?. 
Hence, by repeated differentiation, we can obtain the 4th, 5th, ...2th derivative: 


In the same way as the first derivative gave us information about the slope of a func- 
tion f (x), the second derivative gives us information about the slope of the function 
f'(x), the third derivative about f”(x), and so on. 


Example If y = x, then y' = 1 and y" = 0. All the higher derivatives will be zero. 
Example Consider the equation for SHM (simple harmonic motion): 


x = Asinwt 


x = Aocosot (velocity) 
NC ES ; 
Ee S —Ao? sinet (acceleration) 
We note in passing that, since x = Asinct,X = —w?x, i.e. the acceleration is pro- 


portional to the displacement. (This is another example of a differential equation.) 
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5.7.1 Maximum and Minimum Values of a Function 


In Chap. 3 we showed that certain characteristic points of a function (zeros, poles 
and asymptotes) helped us to visualise its behaviour. 

We are now able to refine our knowledge of the behaviour of a function with the 
help of the first and second derivatives and to find points where the function has 
extreme values (referred to as local maxima and minima). 

In what follows we will assume that the function possesses a second derivative. 


Definition A function f(x) possesses a local maximum at a point xo if all 
the values of the function in the immediate neighbourhood of 
the point хо are less than f (xo) (see Fig. 5.20a). 

A function f (x) possesses a local minimum at a point Xo if all 
the values of the function in the immediate neighbourhood of 
the point хо are greater than / (хо) (see Fig. 5.20b). 


A necessary condition for a function f(x) to have a maximum or a minimum 
at a point x = Хо is that its first derivative f'(xo) should be zero. Conversely, is it 
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У, y 
Xo 
[9] Xo x [9] x 
(a) (b) 
Fig. 5.20 


possible to conclude that, if / (хо) = 0, the function has a minimum or a maximum 
value? 

The answer to this question is ‘No’, as can be seen from Fig. 5.21. At x = x, 
the slope f'(x1) = 0, but to the right of x, the value of the function is greater than 
at x, and to the left of x, it is less than х. Hence we have neither a maximum 
nor a minimum. Such a point is referred to as a point of inflexion with a horizontal 
tangent. 


Fig. 5.21 


At the point of inflexion shown, the curvature changes. Figure 5.21 shows a point 
of inflexion where the tangent to the curve at хү is horizontal. Such a point is also 
called a saddle point. An examination of the derived curve y' (Fig. 5.22) shows that 
the derivative decreases left of the point of inflexion and increases right of that point. 
At the point of inflexion it happens to be zero (hence the horizontal tangent) and the 
curve of у! goes through a minimum at x = x1; thus f" (x4) = 0. This holds for any 
point of inflexion with a horizontal or non-horizontal tangent. 

This example has shown that the condition f'(xo) = 0, although necessary, is not 
sufficient to determine whether the function has a minimum or maximum value at 
the pointx = хо. The value of the second derivative, f" (xo), will give us the second 
condition for a minimum or a maximum. 

Consider the slope of the function in the neighbourhood of a maximum, as shown 
in Fig. 5.23. On the left of xo it is positive and on the right of хо it is negative. 
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П 


Fig. 5.22 


Hence, in ће immediate vicinity of хо, ће slope of the function f(x) decreases 
monotonically and y” (хо) < 0 (Fig. 5.23). 

By a similar argument, if y" (хо) > 0 and у'(хо) = 0 then the function has a min- 
imum at x = xo (Fig. 5.24). 


Maximum or minimum: If /'(хо) = 0 (a necessary condition) and if, in 
addition, f"(xg) < 0, then there exists a local maximum at x = xo. 

If f" (хо) > 0 there is a local minimum. 

Point of inflexion: The condition /7(Хо) = 0 is necessary for the existence 
of a point of inflexion and f” (xo) 4 0 furnishes the sufficient criterion. 


Fig. 5.23 
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Fig. 5.24 


Procedure for the Determination of Maxima or Minima‘ 


Step 1: Calculate the first derivative f'(x). Set f'(x) = 0. Solve this equation and 
obtain its roots xo, x1, X2..., at which points the function may have a mini- 
mum or a maximum. 

Step 2: Calculate the second derivative f"(x). If f"(xg) < 0, there is a maximum 
at x = хо. If f"(xg) > 0, there is a minimum at x = хо. Furthermore, if 
f" (xo) = 0, then there may neither be a minimum nor a maximum, and, 
provided f" (xo) Æ 0, there is a point of inflexion at x = хо. 

Similar checks will have to be made for the points x1,x»,... 


Example Consider the function y — x? — 1. 
Step 1 gives y’ = 2x, so that xo = 0. 
Step 2 gives y" — 2 which is positive. 


Hence the function has a minimum at x = 0. 


Example Consider the function y = x? + 6x? — 15x + 51. 
Step 1 gives y' = 3х2 -12x — 15 = 0 

This is a quadratic equation whose roots are xo = —5,x1 = 1. 
Step 2 gives y" = 6x + 12 

Forxo=—5, у”--30-12--18. 


* This procedure is only valid if the maxima or minima are within the range of definition of 
the function. It is not valid if the maxima or minima coincide with the boundary of the range 
of definition. In order to identify such cases, it is helpful to sketch the graph of the function. 
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Hence the function has a maximum at this point. 


Forxy=1, у" =6+12 = 18. 


Hence the function has a minimum at this point. 


Verify for yourself that this function also has a point of inflexion at x = —2. 


5.7.2 Further Remarks on Points of Inflexion (Contraflexure) 


Consider the curve shown in Fig. 5.25a, de- 
fined by the equation y = f (x). As we trace 
the curve from N to N' its slope varies; it 
decreases from x4 to x2 and increases from 
X2 to xa. The curvature changes from con- 
cave downwards to concave upwards at N 
and N', respectively. The slope of the curve is 
shown in Fig. 5.25b. It can be seen that at x2 
the slope has a minimum value. The second 
derivative can be understood as the 'slope' of 
the slope. 


Figure 5.25c shows that at хү we have y" < 
0. This means that in the direction of the x- 
axis the slope decreases, while at x3 we have 
y" » 0 and the slope increases. 

It was mentioned earlier that a point such as 
P is called a point of inflexion or a point 
of contraflexure. The function y = f(x) 
possesses a point of inflexion at x = x2 if 


y" (x2) = 0 and y" (x2) z 0. 


(b) 


Fig. 5.25a,b 


(с) 


Fig. 5.25c 
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Example Suppose that the deflexion of a uniformly loaded beam, fixed at one end 
and simply supported at the other, is given by 
y-2K(3X?-2X*— X) where X=x/L 


x is the distance along the beam, K is a constant and L is the length, as shown in 
Fig. 5.26. (Note the directions of the axes.) 


Fig. 5.26 


For such a beam there is a point of inflexion at P, and to locate it the condition is 
that y" = 0. 

Differentiating gives y’ = K(9X? — 8X? — 1) and y" = K(18X —24X?). 

Putting y" = 0 gives X, = 0 and X5 = 18/24 = 0.75. 

Of course, only the point Хэ is of interest to us. We now have to check that 
y" (X2) #0: 


y" (X) = K(18—48X) 
y" (X3) = K(18—36) <0 


Thus there is, in fact, a point of inflexion at X2 = 0.75, i.e. at x2 = 0.75L, three- 
quarters of the way along the length of the beam. 


5.7.3 Curve Sketching 


Relationships between variables are frequently derived from physical laws leading 
to equations or functions. These are often difficult to visualise, so it is not easy to 
picture the way the function behaves. The difficulty can be overcome by sketching 
the curve. This is nota matter of plotting each point but of deriving a trend from par- 
ticular points, such as zeros, poles and asymptotes, as has been shown in Chap. 3. 
We now have more precise methods which enable us to find further important fea- 
tures like extreme values and points of inflexion. An example of how a curve may 
be sketched is given below. 

To sketch a curve given by y = f(x), the following steps may be taken in any 
order: 


(i) find the intersections with the x-axis (see Sect. 3.2.3); 
(ii) find the poles (see Sect. 3.2.3); 
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(11) examine the behaviour of the function as x — +% and find the asymptotes (see 
Sect. 3.2.3); 
(iv) find the range of values for x and y; 
(v) find the extreme values (i.e. maxima, minima) and points of inflexion. 


Also, in certain cases, it may be useful to look for symmetry. 


Example Let us investigate the behaviour of the function 


_ (x- 1»? 
f(x) = х2-1 
By rearranging we find 
ТН ахин зах oy x 2x 
= x2 +l | æ+ x2 41 


This shows that f (x) is the result of shifting an odd function (namely —2x/(x? + 1)) 
one unit along the positive y-axis. Remember that a function g(x) is called odd if 
g(—x) = —g(x). An odd function is symmetric with respect to the origin. f (x) is, 
therefore, symmetric with respect to the point (0, 1). 


Intersections with the x-axis 


(x— 1)? = 0 (set numerator to zero) 


giving хо = 1 (repeated) 


Note: the denominator does not vanish at that point. 


Pole Positions 


х2-1-0 (set denominator to zero) 


There are no poles for any real value of x. 


Asymptotes 


In the limit we obtain 


: А 2х 
ga) E (1- zu) нэг 


The proper fractional function (2х/(х2--1)) vanishes as x — +оо, and the line 
parallel to the x-axis f(x) = 1 is the asymptote as x tends to plus or minus infinity. 
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Range of Definition 


f (x) is a fractional rational function. It is defined for all values for which the de- 
nominator is different from zero. In this case it is the entire x-axis. 


Maxima and Minima 


The necessary condition is 


f'(x)=0=2 


It is satisfied for xj = +1 and хә = —1. 
The sufficient condition is 


f”(x) #0 


chm 3x —- x? 
Р (х) CEDE 


Since 
f"(x1) 21» 0. wehavea minimum at x; = +1 


and since 
f'"(x3)2 —1«0  wehavea maximum at x2 = —1 


The coordinates of the extreme points are 
minimum (4-1,0) 


maximum (—1,2) 


Points of Inflexion 


The necessary condition is 


3x — x? 


f" (x) Ve SD 


It is satisfied for хз = 0,x4 = 4- V3 and X5 = 24/3. 
The sufficient condition is 


f" (x) #0 
my ,9x* — 18x? +3 
он (x2+1)4 
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Since f" (x3) =12 40 
f” (x4) = —3/8 £0 
and f" (x5) = —3/8 40 


there are three points of inflexion. Their coordinates are 
(0,1);(+V3,1—1/2x УЗ) and (—V3,1+1/2x УЗ) 


Figure 5.27 shows a sketch of the function, using the information obtained above. 


Fig. 5.27 


By considering symmetry with respect to a point, we could have shortened our 
calculation. We know that there is a minimum at x = +1; therefore there must be 
a maximum at x = —1. We know too that there is a point of inflexion at x = +/3; 
therefore there must also be another one at x = —V/3. 


5.8 Applications of Differential Calculus 


We have developed a number of rules for obtaining the derivatives of various func- 
tions. We are now in a position to apply them to the solution of practical problems. 


5.8.1 Extreme Values 


Here we consider the application of the rule for calculating minimum and maximum 
values. 


Example A cylindrical tank, flat at the top and the bottom, is to be made from thin 
sheet metal. The volume is to be 4 cubic metres. We wish to know the diameter 
D and the height H of the cylinder for which the total area A of sheet metal is 
a minimum. 

The volume of the cylinder is 


VIA 
=—D*H=4 
d 4 


Hence H= E 
xD? 


122 


The area is 


Substituting for H gives 


Amp 
D p 2 
For a minimum aui i 
ho E D= 
ID 0, pai** 0 


Solving for D gives 


16 [2 
D= #) = =2х #/==1.721m 
Л Л 
Note that H = D! 


The other condition for a minimum is 


dA 
— > 0 
dD? 
It is satisfied, since 
dA 32 ipid 
dp? ps ^ 
The required area of metal will be 
3 3 
A= =D? = > x 1.7212 = 13.949 m? 


5.8.2 Increments 


5 Differential Calculus 


A useful application of differential calculus is the calculation of small increments. 
When an experiment is carried out, readings are taken and results deduced from 
them. Normally there is the possibility of some error in the measurements and it is 
then required to calculate the incremental effect on the result. This effect may be 
calculated as follows. The experimental data may be denoted by x and the result by 
y = f (x). Figure 5.28 shows a portion of a graph representing у = f (х). Consider 
the function at P and let x increase by a small amount Ax (error); the corresponding 


increment in y is Ay = f(x +Ах) — f(x). 


An approximate measure for the increment in the value of the function at Q is 
given by Ay. At P the slope is dy/dx, hence the approximate increment in the 


function is 


d 


Ayx m Ax = f'(x)Ax for small Ax 
x 


(5.19) 


The expression is called the absolute error. The relative error is 
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Tangent at P 


Fig. 5.28 


(See also Sect. 5.4.5.) 


Example Suppose a cylindrical vessel of the type encountered in Sect. 5.8.1 (i.e. 
height H — diameter D) is produced automatically. Supposing there is an error of 
2% in the dimensions of H and D, what is the resulting error in the volume of the 
vessel? 5 " 

V=—D?H=—D? 


4 4 
Differentiating gives 
dV 3 
— = nD? 
dD 4 


If AD is the error in D (and H) then 
3 5 
AV z g7? AD 


The relative error in V is 
AV AD 


—— £g 3—— 


V D 
Thus, if the dimensions vary by 2%, the volumes of the vessel may vary by up to 6%. 


5.8.3 Curvature 


Given a function y = f(x), we are often interested in calculating the radius of 
curvature of the function, e.g. in the bending of beams. 

Figure 5.29 shows a portion of the graph of the function y = f(x). P and Р” 
are two points close to each other. Draw tangents PT and P’T’, making angles y and 
V +Ay, respectively, with the x-axis. From P and Р” draw the normals to meet at the 
point C. In the limit, as P' approaches P, this point is called the center of curvature. 
The length of the normal to C is called the radius of curvature, denoted by R. 

1/R is called the curvature. Let us calculate R. We will consider the segment of 
the curve between P and Р”, the length of which is denoted by s. It is approximately 
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Fig. 5.29 


an arc of a circle of radius CP = CP’ = R. From the diagram we have 


1 Av 

RAY =A == — 

y S Or R As 

In the limit, as we take As smaller and smaller, we find 
1 dy ds 
EE d RI 
Rude dy 


We wish to relate R to y and its derivatives and can now use a relationship 
derived in Sect. 7.2 (lengths of curves) which is based on Pythagoras’ theorem 
ds? = dx? +dy?: 


ds / 
иас 1 ”ү2 
di 5552 
Using the chain rule we find 


ds ds dx dx 


u ku de tee 
We also know that 
tan y = y' 
Differentiating this expression with respect to x gives 
1 dy 
t п pm EET 
Bo cos?y dx 
dy dy 
"”-(144 2 Sei IN2 ji 
»' = (1+ tan? y) E = [1+0 28 


Substituting for dx /dy it follows that 
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Radius of curvature 


R=- (5.20) 


This is the desired expression for the radius of curvature at any point x in terms 
of the first and second derivatives of the given function. 


Example Calculate the radius of curvature of the function y = cosx when 


л 
x=-. 
4 
Differentiating twice we have 
1 : ” 
y ——sinx and у =—cosx 
substituting in the equation for R gives 
: 2 4503/2 
pe Otsin 45?) -— 
— cos 45? 


The negative sign means that the curve is concave downwards. 


5.8.4 Determination of Limits by Differentiation: L'Hópital's Rule 


The determination of limits of functions by differentiation has a special significance 
in physics and engineering. For this reason we state briefly /’H6pital’s rule. It gives 
us the values of expressions at points for which the value cannot be calculated di- 
rectly because indeterminate expressions arise. 


0 
The Indeterminate Expression 0 


О Hópital's first rule states: 
If lim f(x) = 0 апа lim g(x) = 0, then 
xXX0 X2XQ0 


UON HC 


i == 
xo (х) о (3) 


if the limit on the right-hand side exists. 
If f’ and g' at x = хо are continuous and 2'(хо) Æ 0, then 
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f(x) _ f'o) abe 


lim = 
xoxo g(x) | g'(xo) 


If lim f’(x)=Oand lim g'(x) = 0, then we apply the same rule again. 
Х-Хо 


Х--Хо 


Example 
1—cosx 


: . 0 
L = lim = 38 of the form — . 

х-0 X 0 
We differentiate numerator and denominator, so that 


L — lim BRUM which is again 2 : 
х-0 4Х 0 


We differentiate the top and bottom again: 


The Indeterminate Expression 5 
оо 


L' Hópital's second rule states: 
If lim f(x) =ccand lim g(x) = ee, then 
X—2XQ0 хэ хо 


20222200 


1 = 
xoxo g(x) | xoxo g'(x) 


if the limit on the right-hand side exists. 
If lim f'(x) = апі lim g'(x) =o, then we apply the rule once more. 
хх х-эХо 
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Example 


By the above rule we have 


Special Forms 


The expressions 0 x 09,00 — co, 1°, 00, 0 can be reduced to 0/0 or oe/co. 
Example The expression 0 x оо 


Inx : 1/х 


lim (xInx) = lim —— = lim ——— = lim (—x)=0 
Do ) x>+0 1/x  x>+0 —1/x? Saar ) 
Example The expression 0 
: : i; lim IX 
lim x!/* = lim е(х195) 2 9I Y^ = 9 1 
X— co X— oo 


5.9 Further Methods for Calculating Differential Coefficients 


We now outline some methods which are useful to know when complicated func- 
tions arise. 


5.9.1 Implicit Functions and their Derivatives 


Functions such as y = 3х2 + 5,у = sin! x,y = ae~* are referred to as explicit 


functions. Functions like x? + y? = R?, x? — 3xy? + y? = 10 where the function 
has not been solved for y are called implicit functions. In this case, y is said to be an 
implicit function of x. Similarly, we could equally say that x is an implicit function 
of y. 

It is occasionally possible to solve an implicit function for one of the variables. 
For example, the equation of a circle of radius R,x? + y? = R?, can be solved 
for y, giving y = +v R? — х2. Remember that in some cases it may be difficult or 
impossible to do this. 


Differentiation of implicit functions 
Differentiate all the terms of the equation as it stands and regard y as a func- 
tion of x; then solve for dy /dx. 
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Example Obtain the derivative dy /dx of x? + y? = R? (equation of a circle). 


Step 1: Differentiate all terms of the equation with respect to x. This is often ex- 
pressed as applying the operator d/dx to each term: 


ах? i: dx ES dx 
Step 2: Carry out the differentiation 
d 
2x +2y 2 =0 (R isa constant) 
dx 
Step 3: Solve for dy /dx 
dy х 
dx y 


Example Obtain dy/dx of x? — 3xy? + y? = 10. 


Step 1: 
d 4 d d 4 d 


—— х” – —(3 
ar" a Bx) + dx 


Step 2: Differentiate 


2dy 


=0 
dx 


dy 
3x = 3 (>? dem zem 


(Note: Treat d/dx(xy?) as a product, i.e. like d/dx (uv).) 
Step 3: Solve for dy /dx 


dy D. 223 
a аг ТУИ 
dx (3y2 — 6xy) pou 
d 27. 2 
dx | y^—2xy 


5.9.2 Logarithmic Differentiation 


Certain functions may be more easily differentiated by expressing them logarithmi- 
cally first. 


Example Differentiate у = VI+ x2- V/14- x4 
Step 1: Take logs to the base e on both sides 


1 1 
Iny = 5 in(1 +x?) + 3 In(1 +x") 
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Step 2: Differentiate the new expression with respect to x 


4 1 2x NEA 

— (in = 

dx I 21412 31434 
1dy xX 4 


- + 
ydx 1+х2 31-34 
Step 3: Solve for dy /dx 


d 4 x3 
2 утаа Vise ( A do ew ) 
X 


14-x? 31+x4 


5.10 Parametric Functions and their Derivatives 


5.10.1 Parametric Form of an Equation 


A curve in a plane Cartesian coordinate system has so far been represented by an 
equation of the form 


y=f(x) 
However, we frequently encounter variables x and y which are functions of a third 


variable, for example / or 0, which is called a parameter. We can express this in the 
following general form: 


x-—x(t) and y= y(t) 
or alternatively in the form 
x=g(t) and y-—A(t) 


For example, in order to describe the movement of a point in a plane, we can specify 
the components of the position vector r as functions of time. Then the parameter is 


the time t: 
r=(x(t), »(t)] 


As time passes, the components of the position vector vary and the point of the 
position vector moves along a curve (Fig. 5.30). 
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Example Describe the movement of a particle projected horizontally with initial 
velocity vg in a constant gravitational field (Fig. 5.31). 


Fig. 5.31 


The horizontal movement of the particle is of constant velocity vg. Thus the 
x-component is 
x(t) = vot 


The vertical movement of the particle is that of a freely falling body with a gravita- 
tional acceleration g. Thus the vertical component is given by 


-8,2 


у(1) = 2 


The position vector of the movement is 


r(t)= (vor, 0) 


Both components depend on a third variable, ће time f. To each value of the param- 
eter ¢ there corresponds a value for x and a value for y. 

Generally, when x and y are expressed as functions of a third variable, the equa- 
tion is said to be in parametric form. 

The parameter may be the time ¢, an angle @ or any other variable. In many cases 
it is possible to eliminate the parameter and to obtain the function of the curve in the 
familiar form. To do this in the case given above, we solve the equation x = x(t) 
with respect to 1: 


X — vot 


je 
vo 


Now we insert this expression for f into the equation for у = y(t): 


8 2 
= X 
2vo? 


y= 
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This is the equation of a parabola, as was to be expected from the known proper- 
ties of freely falling bodies. 


Example Describe the rotation of a point around the circumference of a circle 
(Fig. 5.32). The equation of a circle can be expressed in parametric form: 


x = Ксоѕф 
у= Rsing 


In this case R is the radius, and the parameter is the angle measured from the x-axis 


y, 


Гю. 
"» 


Fig. 5.32 


to the position vector. The position vector is 
r (t) -(Ксовф, Rsing) 


In order to obtain the equation of the circle in Cartesian coordinates, we take the 
square of both parametric equations and add them. 


x+y? = R? cos? ф + R? sin? ф = R? 
If a point rotates with uniform velocity on a circle, the angle ¢ is given by ф = ot. 


Here o is a constant which is called angular velocity (see Chap. 3). The parameter 
is now ¢ and the parametric form of these rotations is 


x(t) = Rcosot 
y(t) = Rsinwt 
The position vector scanning the circle is given by 
r(t) =(Reoswt, Rsinwt) 
Example Describe the parametric form of a straight line in a plane (Fig. 5.33). b is 


a vector pointing in the direction of the line. a is a vector from the origin of the 
coordinate system to a given point on the line. Now consider the vector 


r(A)=a+Ab 
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Fig. 5.33 


This vector r(A) scans all points on the line if the parameter À varies between 
--өө and +æ. Thus, the line is given in parametric form by 


x(A) = ax- Abx 
y(A) = ay+Aby 


If we consider straight lines in three-dimensional space, there is a third equation for 
the z-component: 


z(A) = а +АБ, 


Example Describe a helix in parametric form. Let us consider a point which moves 
on a helical curve (screw). The direction of the screw is the z-axis (Fig. 5.34). With 
one rotation, the point gains height by an amount h. 


x 


Fig. 5.34 


The coordinates of the point are easily given in parametric form if we use the 
angle Ó as the parameter: 


x = Ксов0 


y = RsinO 


5.10 Parametric Functions and their Derivatives 133 


The position vector is 


6 
0) = | Ксов0, Rsi h— 
r(0) ( cos 6, sin, x) 


These examples show that it is sometimes more relevant to the nature of a problem to 
establish the parametric form of a curve. Although, in the plane, it is often possible 
to transform the parametric form into the more familiar relationship y = f (x), this 
may sometimes be more complicated. 


5.10.2 Derivatives of Parametric Functions 


Derivative of a Position Vector 


Given a position vector in plane Cartesian coordinates in parametric form, the pa- 
rameter being the time f, 


r(t) -|x(),  »(0] = (0) +y) 


We can find the derivative of a position vector by finding its velocity. According 
to Fig. 5.35, the velocity is given by 


_ Ar |. r(E AD) — r(t) 
v А0 At AL) At 


Fig. 5.35 


The components are 


[uu Ax _ Ay\ (ах dy 
v= (im E, tim, 3) - (S. x) 


The components of the velocity are the derivatives of the x- and y-components. 
Therefore if a vector is given in parametric form, its derivative can be obtained by 
differentiating each component with respect to the parameter. 
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Example Find the parametric form of the velocity and acceleration of a particle, 
acting under gravity, which is projected horizontally (Fig. 5.31). 
We start with the known equation in parametric form: 


r(t)= (vor, -88) 


The velocity is obtained by differentiating each component with respect to the pa- 
rameter f: 


v(t) = (00, -81) 


For the horizontal component of v we obtain the constant initial velocity vg. For the 
vertical component we get the time-dependent velocity of a freely falling body. 

If we want to know the acceleration, we have to differentiate once more with 
respect to f: 


a(t)=(0, —&) 
We find there is no horizontal acceleration. But there is a vertical acceleration (due 
to gravity). 


Example Find the parametric form of the velocity and acceleration of a point ro- 
tating on a circle with radius R. 
We start with the known equation in parametric form with the parameter ¢ for 
time: 
r(t)=(Reoswt, Rsinot) 


The components of the velocity are given by the derivatives of the components with 
respect to the parameter t: 


d 
ЯГ = g (Reosot, Rsinot) 


v(t) = (—Rosinot, Косово!) 


The magnitude of v is 


v = V R29? sin? wt + R22 cos? wt = oR 
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The acceleration is given by the second derivative with respect to time: 


aie dv(t) 


dt 
Comparing with r gives 


= (-Ro?cosot, -—Ro?sinot) 


a(t) = —w?r(t) 


The acceleration has the opposite direction to r (Fig. 5.36). You can verify for your- 
self that r (1) is perpendicular to v(t) and v(t) is perpendicular to a(t). 

(Hint: The scalar products r · v and v · а must vanish in this case.) 

The magnitude of a is a = œ? R. 

Using the equation v = wR we can express the acceleration in two ways: 


v(t) 


Fig. 5.36 


The Normal Vector 


At each point on the curve there is a tangent vector. A vector perpendicular to the 
tangent vector is called a normal vector (Fig. 5.37). It is easy to find the formula of 
a normal vector: the scalar product of the tangent vector and a normal vector must 
vanish. 
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: dx dy 
G : t t t C= е —= 

iven a tangent vector Є > a ) 
Wanted: normal vector n= (nx,ny) 

d d 
Condition: scalar product = 0: un xt n y-0 
dy /dt d 

Solve for nx: пх = ui = У 


dx/dr? ^ ах 


We are free to choose лу. In Fig. 5.37 a normal vector is obtained by setting ny = 1. 


Note that in solving the equation for nx the parameter t was eliminated. Therefore, 
the result is also true for any curve given in the usual form y = f (x). In this case, 
a tangent vector is given by 


Derivative of a Curve Given in Parametric Form 


Given the parametric equations 
x — x(t) 
y -—y(t) 


we wish to find the differential coefficient dy/dx, i.e. the slope of the curve. We 
proceed as follows. 


! 


Step 1: Differentiate the equations for x and y with respect to the parameter to 


obtain d 4 

de and T 

Step 2: Rearrange to obtain the desired derivative: 
dy  dy/dt 
dx dx/dr 


This is the slope у’ of the function y = f (x) at the point (x, y). Note that we did 
not establish the function y = f (x) to find its derivative. 
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Example Find the parametric form of the equation of a circle. The parameter is 
denoted by 0 this time: 


x = Ксо80 
y = Rsin0 


Step 1: Differentiate x and y with respect to the parameter 0: 


d d 
dg =~ Rsind =—y, Gp = Reso = х 
Step 2: Obtain the derivative of у with respect to x: 
d d 
Z= iu. ed me 
dx 4х/40 y 


Example The cycloid is a curve traced out by a point on the circumference of 
a wheel which rolls without slipping. It is conveniently expressed in parametric 
form. The following gives the equation for a wheel with radius a. The parameter 0 
is the angle of rotation as the wheel moves in the x-direction. Figure 5.38 shows the 
wheel at different positions. The set of parametric equations is 


x = a(0 —sin8) 
y = a(1— соѕ0) 
We wish to obtain the derivative of y with respect to x. 


Step 1: 
d d 
45 = «(1 cos6) , dg «sin 
Step 2: 
dy 4у/40  8ш0 
dx  4х/49  1—cosÓ 


Fig. 5.38 
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Appendix: Differentiation Rules 


General rules 


1. Constant factor 

2. Sum (algebraic) rule 
3. Product rule 

4. Quotient rule 


5. Chain rule 


6. Inverse functions 


Derivatives 
of fundamental functions 
1. Constant factor 


2. Power function 


3. Trigonometric functions 


4. Inverse trigonometric functions 


Function 

y= f(x) 

y=cf (x) 

y =u(x)+v(x) 

y = u(x)v(x) 
. u(x) 

тъ) 

у= f |g(x)] 

jue) 

ie. x = f (y) 
Function 
у= f(x) 


y = constant 


y=x 
y =sinx 
y = cosx 
y =tanx 
y =cotx 


5 Differential Calculus 


Derivative 
y'= f'(x) 
y'=cf'(x) 


Derivative 

y'= f'(x) 

y'=0 

y! =nx"! 

у! = cosx 

у! = —sinx 

1 1 2 

y = 5 =1+tan-x 
cos? x 
-1 

y= — = —1 — cot? x 
sin“ x 

1 

Qs 
vV1-x2 

jus e. 
vV1-x2? 

i= 1 

E 1+x2 

y= : 


Exercises 


Derivatives 
of fundamental functions 


5. Exponential function 


Logarithmic function 


6. Hyperbolic trigonometric functions 


7. Inverse hyperbolic 
trigonometric functions 


Exercises 


5.1 Sequences and Limits 


Function 
y= f(x) 
y-e* 

у= ах 

y =sinhx 

y =coshx 

y =tanhx 

y =cothx 
y= sinh х 


y= cosh-! x 
y= tanh“! x 


y= coth-! x 


Derivative 
у= f'(x) 
у! =ех 
1 
PE 
? =y 
y! = cosh x 
у! =sinhx 
1 
[m E ТИЕ 2 
y= P = 1 — (апһ х 
1 
fees PEL 2 
y NUT E сой x 
1 
у= = 
V14+x2 
1 
ГА 
= 1 
y 5 (x > 1) 
маг 
Topo 
= (10 
7 х2-1 


1. Calculate the limiting value of the following sequences for n — оо: 


n 
(a) adn = — 
n 
1 n 
(c) аһ = (-3) -1 
3 
1 
2n? -n^-cn 
n2—1 


(b) an = 


5-0 
2n 


2 
(d) dn = 2+1 
n 


(f) a, 224-27" 
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2. Calculate the following limits: 


2 
. x*-cl zd 
DE my gas 
20 x2-10x . Е 
(c) lim —— — (d) lime * 
x0 2x Х-эоо 
. vVlc-x—-Vl-x . l-cosx 
(e) lim ———— — —— (f) lim ——— 
x0 X x0 20 


5.2 Continuity 


3. (a) Is the function y = 1 + |х| continuous at the point x = 0? 
(b) Determine the points for which the following function is discontinuous: 


1 fo 2k<x<2k+1 


/e-1u не И k =0,1,2,3... 


(c) At which points is the function f (x) shown in Fig. 5.39 discontinuous? 


Fig. 5.39 
5.3 Series 
4. Obtain the values of the following sums: 
5 1 
(а) 55 = Y ЦЭ 
v=1 v 


9 154 
(b) Sio = PE (5) 


oo 1 n 
(c) What is the value of the sum S = У, G) ? 
n=0 


5.4 Differentiation of a Function; 5.5 Calculating Differential Coefficients 


5. (a) Given the curve y = x? — 2x, calculate the slope of the secant to the 

curve between the points Хү = 1 and x2 = 3/2. Compare the slope of 
the secant with that of the tangent at the point x; = 1. 

(b) The distance-time law for a particular motion is given by 
s(t) = 31? — 8t m. Evaluate the velocity at t = 3 s. 

(c) Determine the differential dy of the following functions: 

à) f(x) =x?47x 

Gi) f(x) =x -2x4 +3 
ii) f (x) = 2(x? +3) 
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6. 


Differentiate with respect to x the following expressions: 


(a) 3x5 (b) 8x —3 (c) x7/3 
(e) x?—2x 
(d) 7x3 — 4х3/2 5x? 


. Obtain the derivatives of the following: 


1 
(а) у= 2х? (b у= yx (© у= == 
= 2x —(y2 3 2124 T 
s RR aa (e) y = (x^ +2) (f) y 2x d 
3 
(p y=V14 x? w y= (a- 2) 
. Differentiate 
(a) у = 3cos(6x) (D y —4sin2zx) | (c y= Ae "sin2zx) 
(d) y 2In(x4- 1) (e) y =sinxcosx (f) y =sinx? 


(g) y = (3x? - 2? h) y=asin(bx+c) (i) y= ө2х3-4 


9. Differentiate (inverse trigonometric functions) 
(а) y — cos" (cx) (b) y = Азаа (x 4-2) 
(с) y = sin 132) (d) y =coth7!(./x) 
10. Differentiate (hyperbolic trigonometric functions) 
(а) y = Csinh(O.1x) (b) u = ztanh(v + 1) 
(с) n= In(cosh£) (d) s =In(cosht) 
(e) у = sinh? x — cosh? x (f) y 22xcothx — x? 
11. Differentiate (inverse hyperbolic trigonometric functions) 
(a) у = Asinh-! (10x) (b) u = C coth^! (v + 1) 
—1r/. i zu x—1 
(c) ņn = tanh7 (sin £) (d) y = sinh 
x 
5.6 Higher Derivatives; 5.7 Extreme Values and Point of Inflexion 
12. Obtain the following derivatives: 


(a) 2(ф) =asing + tang, required g’(@), 1st derivative 
(b) v(u) = ue", required v" (и), 2nd derivative 
(c) f(x) = ах, required f" (x), 2nd derivative 
(d) k(x) = x? +2x7, required AOV (x), 4th derivative 
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13. Find the zeros and the extreme values for the following functions: 


(а) y = 2х4 — 8х2 


(c) у — sin(0.5x) 


(e) y = 2cos($ 4- 2) 


(b) y —3sin$ 


1 
(d) y =2+45x° 


14. 


15. 


2 
(f) y= 3х5 2x? — 6х 
Points of inflexion. Show that the following functions have a point of inflex- 


ion. Calculate the value of the function at such a point. 


(a) y =x? — 9x? +24x—7 
(b y 2x*—8x? 


Curve sketching. Sketch the following functions: 


1 
(a) yeu 


4x+1 
b гэ 
(0) y 2x+3 
2 
x“ —6x+8 
(© у= x2 —6x+5 


5.8 Applications of Differential Calculus 


16. 


17. 


18. 


Errors. 

(a) A tray in the form of a cube is to be manufactured out of sheet metal. 
It is to have a cubic capacity of 0.05m?. If the tolerance on the linear 
dimensions is not to exceed 3 mm, calculate the change in the volume 
and in the area of metal as a percentage. 

(b) The height of a tower is calculated from its angles of elevation of 35? 
and 28°, observed at two points 100 m apart in a horizontal straight line 
through its base. If the measurement of the larger angle is found to have 
an error of 0.5°, what will be the error in the calculated height? 


Curvature. Calculate the radius of curvature for the following functions: 
(a) y=x3 at x=1 
(b y2=10x at x=2.5 


L'Hópital's rule 


i In(1--1 
(a) lu m (b) lim In(1 + 1/x) 
х0 x x40 1/x 
1 1 1 
: si d) li 
(c) um xt"? (9 хэрх (= ux) 
1 3 
tan| Z x| (f) lim 22 


X— oo 


yx 


(e) lim x 
x1 


. cosa — cosbO 
(g) qm Lm 03 


Exercises 143 


5.9 Further Methods 
19. Implicit functions. Obtain the derivative y' for the following expressions: 


(a) 2х2 +3у2 = 5 
(b) 3х3у2 +хсоѕу = 0 
(c) (х+у)2+2х+у=1 at х-1,у--1 


20. Logarithmic differentiation. Obtain y' for the following expressions: 
(a) y = (5x +2)(3x — 7) 
(b) y = xsinx 
(с) у= х* 
5.10 Parametric Functions and their Derivatives 


21. Parametric equations. Obtain dy /dx for the following expressions: 
(а) x=ut and y=vt—1/2gt? 
u,v and g are constants 
(b) х= a(cost +fsint) 
y = a(sint — t cost) 


22. A point rotates in the x— y plane with a radius R around the origin of the 
coordinate system with constant angular velocity. In 28 it completes 3 revo- 
lutions. Give the parametric form of the movement. 


23. (a) The parametric form of a curve is 


x(t) 2t 
y(t)=t 
z(t) 2t 


What curve is it? 
(b) What curve is described by the following equations? 


x(t) = acost 
y(t) = bsint 


24. (a) Calculate the acceleration vector 


vx (t) = —vosin wt 


vy(t) = vocosot 
(b) The position of a point in three-dimensional space is given by 


r(t) = (Rcosot, Rsinot, t) 


1 2л 
Calculate the velocity for t = — 
0) 
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(c) The acceleration of a freely falling body is 
a = (0,0,—g) 

Calculate the velocity v(t) if 


v(0) = (v0,0,0) 
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Chapter 6 
Integral Calculus 


6.1 The Primitive Function 


6.1.1 Fundamental Problem of Integral Calculus 


In Chap. 5 we started from a graph of a function which could be differentiated and 


obtained its slope or gradient. 


The problem was to find the derivative f'(x) = dy/dx of a given function 


у= f(x). 


This problem can also be reversed. Let us assume that the derivative of a function 


is known. Can we find the function? 


Example A function is known to have the 
same slope throughout its range of definition 
(see Fig. 6.1) i.e. 


yo = т 


Can we find the function? То do so we re- 
view all functions known to us to find out 
whether there is among them one which has 
a constant slope. 

In this case, we know such a function: it is 
a straight line. Hence, one possible solution 
of our problem is a straight line with a slope 
m through the origin, i.e. 


у(х) = тх 


Figure 6.2 shows such а function. The рго- 
cess of finding the function from its deriva- 
tive is called integration and the result an in- 
definite integral or primitive function. 


Fig. 6.2 


К. Weltner, №. J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
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A general statement of the problem is as follows: 
Let a given function f (x) be the derivative of a function F(x) which we wish to 
find. Then F (х) has to satisfy the condition that 


F'(x) = f(x) (6.1) 


Example Let f(x) — m, a constant. We have already found a solution to be 
F(x) = тх 


We can easily verify the result by differentiating: 


F(x) = Š (mx) =m 


Remember that the derivative of a constant term is zero. Hence, if a constant term 
is added to the function F (x) just found, the function obtained will also have the 
same derivative. Therefore we can put 


F(x) =mx+C 


where C is any constant. 

It obviously follows that there is not just one solution but many others which 
differ only by constants. Thus the solution of the equation F'(x) = f (x) gives rise 
to the family of curves given by 


y=F(x)+C 


In our simple example, all straight lines with the slope m are primitive functions of 
f (x) = т, as shown in Fig. 6.3. 


Fig. 6.3 
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In order to obtain a particular primitive function from the whole family of prim- 
itive functions, we need to specify certain conditions. 

We may, for example, specify that the function must pass through a particular 
point given by a set of coordinates. Such conditions are known as boundary condi- 
tions. 

In our example, suppose we specify that when x = xo, y = yo. Substituting in 
the equation y = mx + C, we find 


yo = mxo +C 
Hence C = yo = тхо 


The final solution is 
y = тх + уо = тхо 


This is shown by the solid line in Fig. 6.3. 
All primitive functions differ from each other by a constant which can be deter- 
mined from specified boundary conditions. 


6.2 The Area Problem: The Definite Integral 


Consider the problem of calculating the area under a curve. The area F (shown 
shaded in Fig. 6.4) is bounded by the graph of the function f (x), the x-axis and the 
lines parallel to the y-axis at x = a and x = b. If f(x) is a straight line, then the 
area F is easily calculated. We now develop a method for the evaluation of F which 
is applicable to any function, provided that it is continuous in the interval a < x <b. 
For the time being, we will assume that f (x) is positive in the interval considered. 


Fig. 6.4 


We divide the interval into n sub-intervals of lengths Ax,,AÀx25,..., Ax, and se- 
lect from each sub-interval a value for the variable ху, as shown in Fig. 6.5. The 
value of the function, or height, is f (Хү). 

The area F is approximately given by the sum of the rectangles 
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Fig. 6.5 
Е x f (x1i)AXx1 + f (x2)Ax2 t M f (xi Ax; +--+ f(xn)AXn 
This sum is more compactly written 
n 
Fr ЭЭ f (xi)Axi 
i=1 


We now wish to find the limit of this sum for n increasing indefinitely. Each Ax will 
diminish indefinitely at the same time (Fig. 6.6). 


Fig. 6.6 


We know intuitively that by this process we shall obtain the exact value for the 
area F. 
Hence 


A new symbol is now introduced to denote this limiting value and write 


в- урдах 
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The symbol / is called an integral sign and the expression is called a definite in- 
tegral. It is read as ‘the integral of f(x) dx from a to b’. The integral sign is an 
elongated S and stands for ‘sum’. It should be remembered, however, that the inte- 
gral is the limit of a sum. The limiting process is valid for continuous functions. For 
discontinuous functions we have to prove in each case that a limit exists. 

If we take the greatest value of the function in each sub-interval as the height of 
the rectangles, then the sum is called an upper sum; if we take the smallest value of 
the function in each sub-interval, then the sum is called a lower sum. For continuous 
functions the upper and lower sums will coincide in the limiting process. 

The dx after the integral sign should not be left out as it is part of the process we 
have just examined. 


Definition 


b 
Е = lim У f (xi)Ax; = 1 f(x)dx (6.2) 


The symbol P f (x) dx is called the definite integral of f (x) 
between the values x — a and x — b. 


e ais called the lower limit of integration, 
* biscalled the upper limit of integration, 
e f(x) is called the integrand, 

e x is called the variable of integration. 


6.3 Fundamental Theorem 
of the Differential and Integral Calculus 


The fundamental theorem states: The area function is a primitive of the func- 
tion f(x). 

How is it arrived at? We begin with a continuous and positive function f (x) and 
consider the area below the graph of the function (shown shaded in Fig. 6.7). 
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In contrast to the previous area problem whose limits were fixed, here we con- 
sider the upper limit as a variable. It follows, therefore, that the area is no longer 
constant but is a function of the upper limit x. 

x now has two meanings: 


1. x is the upper limit of integration; 
2. х is also the variable in the function f (x). 


To avoid any difficulties that might arise because of this double meaning, we will 
change the notation and use f£ as the variable in y = f(t) and consider the area 
below the graph of this function between the fixed lower limit = a and the variable 
upper limit і = x, as shown in Fig. 6.8. 


Fig. 6.8 
The area under the curve is 
X 
F(x)= | fiat 
a 


The function F (x) defines the area below the curve of f(t) bounded by t = a and 
t = x. Figure 6.9 shows the function F(x) for the curve f (t) depicted in Fig. 6.8. 
We call the function F (x) the area function. 


F(x) 


6.3 Fundamental Theorem of the Differential and Integral Calculus 151 


We now ask ourselves the question: what is the nature of F(x)? To answer this, 
first of all consider a small increase in the upper limit x by an amount Ax. The area 
increases by the amount of the shaded strip shown in Fig. 6.10. 


Fig. 6.10 


The area function increases by AF as shown in Fig. 6.11. 
This increase in the area, AF, lies between the values /(х)Ах and f(x +Ax) 
Ах, Le. 
f(x)Ax € AF € f (x -- Ax)Ax 


Fig. 6.11 


We should note that this is valid for a monotonic increasing function. If the function 
is a decreasing monotonic function, the argument is still valid, except for a change 
in the inequalities. 

Dividing by Ax throughout gives 


FS Е < f(x+Ax) 
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Let us now consider the limiting process as Ax — 0 for the area function 
AF dF 
— = — = F' 

Ах-д0 Ax dx x) 


We also have 


Hence we find 


This means that 
F'(x) = f(x) 
The derivative of the area function F(x) is equal to f(x). In other words, the area 
function is a primitive of the function f (x). 
This is the fundamental theorem of the differential and integral calculus. It em- 
bodies the relationship between the two. 
The area function is given by 


F(x) = [Глод 


Differentiating gives 


ғо) m (f roa) = ro 


If we carry out an integration, followed by a differentiation, the operations cancel 
one another out. Thus, loosely speaking, differentiation and integration are inverse 
processes. 


The fundamental theorem of the differential and integral calculus 
X 
If ra= f (t) ш 
a 
then F'(x)= f(x) (6.3) 


So far we have not paid much attention to the choice of the lower limit a. We 
will now investigate how the area function changes if we replace the lower limit a 
by a new one, a’, as shown in Fig. 6.12. 

Let Еу (х) be the new area function. F(a’) is zero; Ё (а) corresponds to the 
area between a’ and a. If we now consider the course followed by the original area 
function F (x), we see that the new area function F; (x) is made up of two parts: 


F(a) = area between a’ and a (which is a constant) 
and F (x) = area between a and x 
Hence Fi(x) = F(x)+ Fi(a) 
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Fig. 6.13 


Thus a change in the lower limit leads to a new area function which differs from the 
original function by a constant. This is illustrated in Fig. 6.13. It is in accordance 
with the fact we found earlier that primitive functions differ from each other by 
a constant. 


6.4 The Definite Integral 
6.4.1 Calculation of Definite Integrals from Indefinite Integrals 


We will now proceed to calculate the value of the definite integral from the geomet- 
rical meaning of the primitive function as an area function. 

We require the area shown shaded below the function y — x and between the 
limits a and b in Fig. 6.14. 
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Fig. 6.14 


A primitive function which satisfies the condition F'(x) = f(x) is 


l 2 
F(x) = 55 
This can easily be verified by differentiation. 

This primitive function is the area function for the lower limit x = 0 and rep- 
resents the area below the graph and between the limits x — 0 and x. We wish to 
calculate the area for x — a to x — b. We know from Sect. 6.3 that this area is 
the difference between two areas, namely the area F (b) bounded by the graph and 
the limits x — 0,x — b, and the area F(a) bounded by the graph and the limits 
x =0,x =a (see Fig. 6.15). Hence the area required is 

A= F(b) — F(a) 
Ё З 22 a? 
In our example, this area is А = = — 5 

Having obtained the value of a definite integral by means of a particular example, 
we can now generalise the procedure. 

We obtain a primitive function F(x) for a given f(x) and then form the differ- 
ence of the values of the primitive function F (x) at the positions of the upper and 
lower limit. As a shorthand notation, a square bracket with the limits as shown is 
often used. 


Calculation of a definite integral: 
b 
f. FO) ах = [F0] = F0) – F(a) (6.4) 
a 


Primitive functions for a given f(x) differ only by an additive constant which 
cancels out when a difference is formed as above. 
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Fig. 6.15 


The definite integral is not restricted to the geometrical case of calculating area, 
as the following example shows. 


Example What is the distance covered by a vehicle in the time interval £ = 0 to 
t — 12 seconds if the velocity v is constant at 10 m/s? 
The distance is given by 


t» 
s= 1 v dt = [vt]? = v(t3 — t1) = 10x 122 120m 
f 
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6.4.2 Examples of Definite Integrals 


Integration of x? 


We wish to calculate the area under the parabola y = x? between xı = 1 and x = 2 
shown in Fig. 6.16. 


Fig. 6.16 


The area required is A = f? x?dx 


First we have to find a primitive function whose derivative is f(x) = x?; such 
a function is 
x 
Е(х) = = 
0) = 5 


We can easily verify this statement by differentiating F (x). 
The area required is then given by 


2 3 
4-1 ax = [5 
1 3 


Integration of the Cosine Function 


= 5 = й units of area 
Ep 3 8 


2 
1 


We want the area under the cosine function in the interval 0 < x < x /2 (Fig. 6.17). 
The required area is A = fọ /2 cos x dx 
From our knowledge, we are able to identify a primitive function of 
f (x) = cosx. It is 


F(x) =sinx 
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Fig. 6.17 
Hence we have 
л/2 л 
4-1 cosx dx = [sinx]g/^ = sin — sind = 1 
0 


If the area lies below the x-axis, the definite integral is negative. 
Consider the area under the x-axis in the interval 2/2 < x < 32/2 (Fig. 6.18). 


Fig. 6.18 


It is 
3л/2 32/2 3л л 
A= dx = [sinx]277? = si in~ ——1—1- —2 units of 
ёр cosx dx = [sin Х| ja sin —sin 7 units of area 

If we want to find the absolute value of an area, we must pay attention to the value of 
the function between the limits of integration, i.e. whether the function lies entirely 
above the x-axis, below it, or partly above and partly below. 

If the function is partly positive and partly negative, it is necessary to split it into 
parts, as shown in the following example and illustrated in Fig. 6.19. 

Suppose we require the absolute value of the area bounded by the function cos x 
for 0 € x < л. We proceed as follows: 
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First: Since the function is positive 
for 0 € x € 7/2, the area A, = 1 
(see above). 

Second: Since the function is negative 
for 7/2 € x < л, the area is given by 


JU 
cosx dx = [sin x] 
12 [ ]xj2 


: . Л 
ES sinx — sin 5 --1 
If this area is to be taken positive, we 
must take its absolute value, i.e. 
A5 —|-1|-21 


The total absolute area is A = Ay + 
A» = 2 units of area. 


Fig. 6.19 


Uniformly Accelerated Motion 


Example This is an example which differs from the calculation of an area and 
shows an application of integral calculus to physics. 

We first calculate the velocity of a rocket moving with a constant acceleration of 
15 m/s?, 40s after starting from rest. 

The relationship between acceleration a and velocity v is 


d t instant / 
а = — , atany instan 
dt 
The velocity is given by 
t2 
TS | a dt = [at]? = a(t — t1) = 15(40— 0) = 600m/s 
f 


In addition we wish to find the distance covered by the rocket during that time. The 
relationship between velocity and distance is v = ds /dt at any instant f. Substituting 
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t 
v= | a dt = at 
0 


in the expression 


gives 
ds 
at = — 
di 
2 (219 15 
Непсе 8-1 at dt = || = (2—12) = (402 — 0) = 12000m 
Я 21.2 2 


6.5 Methods of Integration 


We now consider general methods for determining primitive functions. The primi- 
tive function is an indefinite integral, indicated by the symbol 


| f (x) dx 


6.5.1 Principle of Verification 


The problem of obtaining the derivatives of functions can always be solved. But for 
the inverse problem, integration, a solution cannot always be found. Integration is 
a more difficult problem, and this makes the principle of verification important. We 
try to guess a solution and check whether it is a solution by differentiating, since 


ғо) = z (f rena) = re 


The function f (x) is given. We assume that F (x) is a primitive function. 

Next we test the assumption by differentiating F(x) and comparing F'(x) with 
f (x). Our assumption is valid if F'(x) = f(x). In this case, F(x) is a primitive 
function of f(x). If our assumption is incorrect, i.e. F'(x) Æ f(x), we have to 
make a new assumption and repeat the procedure until a solution is found. For this 
task we shall find tables of integrals a great help. These tables, which are universally 
available, cover a great many cases. 


6.5.2 Standard Integrals 


The integration of basic functions can be found very easily by applying the funda- 
mental theorem of differential and integral calculus, integration being the inverse 
of differentiation. Table 6.5 gives some elementary functions and their integrals; it 


160 6 Integral Calculus 


is easy to verify the results by differentiation. A more comprehensive table will be 
found in the appendix at the end of this chapter. 


Function Integral 
5 xnl 
x" (n z —1) SY +C 
sinx — cosx +C (6.5) 
cosx sinx +C 
e* ех +С 
1090) Inx|-- € 


6.5.3 Constant Factor and the Sum of Functions 


Many integrals can be simplified before carrying out the integration. It is very wise 
to carry out the initial step of simplifying integrals as it reduces the amount of work 
involved and time is therefore saved. 


Constant Factor 


If k is a constant then 


[40 ах-4 f(x) dx (6.6) 
Proof Let F(x) = f f (x) dx. Then it is true to write 


kF(x) = JE dx 
Differentiating both sides gives 


kF! = kf (x) 


Sum and Difference of Functions 


The integral of the sum of two or more functions is equal to the sum of the integrals 
of the individual functions, i.e. 


|0) во ах frat fed n 
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Proof Let F(x) be the primitive function of f(x) and G(x) that of g(x). Then 


F(x)+G(x)= [re ах+ f g(x) dx 
Differentiating both sides gives 


Ех) + G(x) = f(x) + g(x) 


For the difference of two functions it follows that 


|6) вах = f re)as- (хуйх 68) 


6.5.4 Integration by Parts: Product of Two Functions 


This follows directly from the product rule for differentiation. Let the functions be 
u(x) and v(x). If we differentiate the product uv we have 


L tuwa) = Svo) + Pate) 


dx 
or, written in a more concise way, 
(uv)! 2 u'v a uv' 


By transposing we get 
uv’ = (uv)' — u'v 


Now we integrate this equation and can hence write down the primitive function 
straight away as follows: 


n dx — uv - fu dx (6.9) 


The integral on the right-hand side is frequently much easier to evaluate than the 
one on the left-hand side. This method is particularly useful when the expression 
to be integrated contains functions such as logx and inverse functions. The right 
choice of u and v’ is decisive. 


Example / xe* dx 
Letu —x, v'—e*. 
Then wu’ = 1 and we know that v = e*. 
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Substituting in the equation gives 


fuv ax=uv- fou! ax 


n dx = хех fe dx =e*(x—1)+C 


We must not forget the constant of integration С. 
Suppose we had chosen и = ех and v' = x. 
Then v’ = e* and v = 5x*. 

Hence 


We note that the right-hand integral will be more difficult to solve than in the previ- 
ous case. 


Example 


| x?e* dx 
Let us try u = x? and v' = ех. 

Before going any further we must consider whether our choice is going to be 
favourable. We need to consider the vu’ product in f vu'dx and do a rough calcula- 
tion. 

From u — x? it follows by differentiation that 


uer 
And from v’ = e* it follows by integration that 


0 =е* 

Aside: If you аге meeting this for ће first time, you should get into the habit of 
making quick and rough calculations, and not leave the choice of u and v’ to chance. 
We know that powers of x are reduced by one when differentiated. The term e* 
remains unchanged when differentiated. If we had set u = ех and v’ = x?, then we 
would have found an increase of one in the power of x, i.e. and x?, leading to a more 
difficult integral on the right-hand side. 

Going back to our example, we have 


Jes dx =x?" —2 | xe” dx 


We have already computed the integral on the right-hand side in the previous exam- 
ple. It is 


e*(x—1)4 C' 
Thus [ 98 ах = de 2-1) co 6 (32-2х--2) +С 


We can easily verify the result by differentiating — you are advised to do so. 
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Example 
| 5e» dx = ЕЕЕ ах 


Let u = sin x and v’ = sin x 
Then u’ = cos x and v = — cos x 


n dx = -sin xcosx + | cos? x dx 


2 


Since cos? x = 1 — sin? x, then, by substituting, we find that 


| айх dx = -sinxeosx + 0 — sin? x) dx 


= —sinxcosx x | зіп? х dx 


Transposing gives 


1 
2 | sin? x dx = x—sinxcosx +C! =x —5sin2x-+C' 


Hence we obtain the final solution: 
1 1 
n dx — Z- 5 Sinxcosx +C = Z- 4 sindx +С 


With the help of integration by parts, it is often possible to simplify integrals whose 
integrands are of the nth power, since the exponent can be reduced by one each 
time. Applying the method successively may lead either to a standard integral or to 
another integrable expression. This was demonstrated in the last two examples. 

For the sake of completeness a further point should be noted. If the exponent of 
the sine function is some arbitrary number (+ 0), then the integral can be solved 
step by step. This process leads to what is known as a reduction formula. 


Example Reduction formula for f sin" x dx. 
Let u = sin"! x and v' = sin x 
Then u’ = (n — 1)sin"~? xcosx and v = —cosx 
The integral now becomes 


| алхах = -cosxsin 1 x+ (n — 1) зай 2 хсоё x dx 


Recalling the identity cos? x = 1 — sin? x, the integral on the right-hand side can be 
split into the sum of two integrals: 


| их dx =—cosxsin" -lx+ (1—1) f sin 23 dx—(n—1) | зийх 4х 


Observe that the integral to be solved now appears on both sides of the identity. 
Rearranging leads to the desired formula: 
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1 -1 
| их dx = ——cosxsin?~! x + 7 f sin"? x dx (6.10) 
n n 


Remember that this formula is valid for any exponent n Z 0. If 7 is negative then 
it is to be read from the right to the left. In the exercises, the reader will be invited 
to derive the reduction formula for f cos" x dx. 


6.5.5 Integration by Substitution 


Suppose we want to evaluate the following integral: 
тах +b) dx 


How should we proceed? 

A minute’s thought leads us to try to reduce it to a standard form, such as 
f sinu du. 
We can substitute и = ax + b 

But we still have to find a substitution for dx. To do this we differentiate и = 
ax + b with respect to x and find that 


From which dx = 1/adu 
The integral now becomes 


1 1 1 
әпи du = z [sinu du = —— cosu + C 
a a a 


To express the result in terms of the original variable x, we substitute back for 
и — ax + b and obtain 


1 
| ямах--9) dx = — 7 соз(ах +b)+C 


Substitution then enables us to reduce the function into a standard form. Further- 
more, it also often transforms a difficult or an apparently unsolvable problem into 
a solvable one. 

The method of substitution is carried out in four steps. 
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To solve Example 


EC dx | «гах 
Choice of a suitable u=g(x)= ух 
substitution which promises to 


make the problem easier. 


/ e" dx 
Substitute for 

(a) the function du 2 1 

(b) the differential dx. dx 2 /x 
In order to carry out (b), we dx =2,/x du 
differentiate the substitution, 

solve it for dx and express it in —2u du 
terms of u. peru d 


Integrate with respect to the 


u m u 22 
new variable и. ме du = 2е#(и– 1) +С 


Substitute back to express ће 
solution in terms of 
the original variable x. 


2e”(u—1)+C = 
2еУх (/x — 1)-- C 


There are no general rules for finding suitable substitutions. 

To complete the discussion on this method, let us assume that the integrand can 
be put in the form f {g(x)} as a function of a function. Then the integral we have 
to solve is 


74809) ах 


We now introduce а new variable by a substitution. 
Let u = g(x) for the inner function. 
Differentiating u with respect to x gives 


du 48  , 
di ae cs (x) 
We solve for dx to give 
1 
dx = du 
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The integral becomes 


[ео = ло) (6.11) 


This new integral must be in terms of u only. 


6.5.6 Substitution in Particular Cases 


We examine four substitutions which will reduce certain types of integrals to a stan- 
dard form. 


[flax +b) dx substitution: u=ax+b 
f(x) “7 
substitution: и = f(x) 
f(x) 
[flee (x) dx substitution: и = g(x) 
| кмах,соу x,tanx,cotx) dx substitution: и = tan > 


Integrals of the type f f (ax +b) dx 
The integral is a function of the linear function (ax + b). By letting u = ax + b, the 


integral is simplified; now du /dx = a or dx = 1/adu. 
The integral becomes 


Їл (ax+b)d =-ffwa (6.12) 


It is now much simpler than the original integral. If f (u) is a simple function its 
solution is known. We found this to be the case when we introduced the substitution 
method and considered the integral 


n +b) dx 
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To illustrate this point let us look at some further examples: 
Example 


2 
хэсгээ e 
| cos? (4x — 12) л 
Let u = 4x — 12, then du = 4dx; hence dx = 1/4du. The integral becomes 


4 3E cos? u 


This is a standard integral given in the appendix at the end of this chapter: 


= = tanu +C 


2 E 


Substituting back in terms of x gives the final solution: 


2 1 
dx = = tan(4 12 
12220 SD xs yee 


Example 


/ хүхэр 


Let u = ах +b, then dx = 1/adu 
Hence we get one of the standard integrals given in the appendix at the end of 
this chapter: 
Э du 5 


= 
Їр alte ас wr 


= 2 tan! ах) +С 


1 
Integrals of the type J 25 4 


The integrand is a fraction whose numerator is the differential coefficient of the 
denominator. 

Let u = f(x), then f'(x) dx = du 

Hence we have 


fx). dw. E 
Tol ах-/ =Inlul+C=In| f(a) +C (6.13) 


In many cases we will have first of all to put the function to be integrated in the 
above form. 
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Example 
| x+a/2 T 2x+a 
TE EE Eb Eran 
Let u = x? -- ax + b, then du = (2x +a) dx 
Hence 
1 2x +a du 1 
I C 
атр“ Е E b 
= шр +ax+b|+C 
Example 


COS X 
jose og 
a+bsinx 


Let u = a + bsinx, then du = b cos x dx or cos x dx = 1/b du 


Hence 
COS X du — 
71 C 
[Se wie myst 


= Š Inja +bsinx|+C 


Integrals of the type f f (e(x))g'(x) dx 


The integrand is a product, but what is important is the fact that the second function 
is the differential coefficient of the inner function. 
To solve the integral, let 


ца (х), g'(x)dx= du 


Hence we have 


о) в о) ах = [ riy au (6.14) 


Example 
J sin? x cos x dx 


Let u = sin x, then du = cos x dx 
Hence 


1 
| sin? ховх dx= f u?du = gee+e 
Бо 
= —sin"x 
3 


In many cases we first have to generate the form of the integrand which corresponds 
to (6.14), as the following example shows. 


6.5 Methods of Integration 
Example 


[is +tan? x + 1) dx 


169 


The integrand does not contain the factor 1/cos? x which we need in order to apply 
the method explained. We therefore expand with cos? x by using the relation 


1 
cos? x 


4 2 
4 2 (гап? x + tan^ x + 1) 
t t 1) dx = d 
nc x +tan* x 4- 1) dx | (ist та x) соёёх х 


=1+tan?x or 1-(144аа x) cos? x 


We find 


Now let u = tan x, then du = 1/cos? x dx. 
We obtain a new integral, namely 


4 2 
u` +u +1 2 1 2 / аи 
/ lcu? 5 f (v r) 4 Ї y 1+u2 


i.e. two standard forms. 
The final solution is 


1 1 
3 -Htanlu-C- gan x + tan” (tanx) +С 


1 
= Qum х+х +С 


Integrals of the type | R(sinx,cosx,tanx,cotx) dx 


The integrand is a rational expression, denoted by R, of the trigonometric functions. 
It can be transformed into a more accessible form by substitution и = tan x/2, i.e. 


x—2tan lu 
2du 


BU 


The trigonometric functions can all be expressed in terms of u. Thus 


By differentiating we get dx 


. T OX 2tan(x/2) 2u 
sinx = 2sin — cos = = = 
2 2 1+tan?(x/2) 1-и2 


1—tan?(x/2) 1-и? 
1 +(ап2(х/2) 1-2 


cosx = cos?(x/2) — sin?(x/2) = 


Я sin х 2и 
anx = = 
cox 1—u? 
cosx 1-и2 
cotx = = 


sinx 2u 


Thus the integral is transformed into one whose integrand is a function of u. 


(6.15a) 


(6.15b) 


(6.15c) 


(6.15d) 


(6.15e) 
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Example 
d 1+u? 2d d 
/ iid 2), TH 5 =) = In|u| +C =Inftan >| +C 
sinx 2u 1+и2 u 2 
Example 


dx 1 du du 
ТН ай йн 
1+sinx 1+2u/(1 +u?) 1+и2 1+2u +u? 


du =2 
=2 = С 
147 T+ tan(x/2)* 
In the table of fundamental standard integrals (at the end of this chapter) we find 


that : 
X X л 
———  =tan( = -= . 
Ї хээ: an(5 a) +e 


The reader should verify for him- or herself that these results differ by a constant 
only (= 1). 


6.5.7 Integration by Partial Fractions 


We now consider the integration of functions where the numerator and denominator 
are polynomials. In Chap. 4, Laplace-Transformations, we will use this technique 
extensively. Such functions are called fractional rational functions and have the 
following form: 


P(x) anx” as qat esca xag 
© Q(x) bmx” + by 1x71 4 E bix + bo 


where m and n are integers and an and bm Z 0. The coefficients a; and b; are real. 
If n <m, R(x) is a proper fractional rational function; in short, a proper fraction. 
If n > m, R(x) is an improper fraction, e.g. x4/x? +1. 
Any improper fraction can be transformed into a sum of a polynomial and 
a proper fraction by simple division. For example 


x4 X 


Eq 7 FI 
Our discussion is restricted to proper fractional rational functions, such as the sec- 
ond expression in the above example. 
In order to understand the expansion of such a function, we have to remember 
the fundamental theorem of algebra. This theorem states that a rational function of 
degree n, such as 


P(x) = anx” дусу pcd р 
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can be resolved into a product of factors, each of which is linear: 


Р(х) = an(x — xi)(x — x2): (x — Xn) 

а; are constant real coefficients (an # 0) . 

xj are the real or complex roots of the equation Р(х) = 0. 
Complex roots occur in pairs and are conjugate, e.g. a+ jÓ and a — jf. (Complex 
numbers are treated in detail in Chap. 9.) 
Example The cubic equation 

х3-х2-4х-4-0 
has the following roots: x1 = 1,x2 = 2,x3 = —2. 
Hence, х2 —x* —4x-+4= (x—1)(x—2)(x+2). 


Such an expansion into linear factors will help us to integrate fractional rational 
functions. Assume that the integral to be solved is of the form: 


[= ана Аар 
хт E by 4x71 +--+ Бо 


(Note that bm = 1 is easily obtained by division, and that n < m.) 

The integral is then resolved into a sum of proper partial fractions. The form of 
the partial fractions is dictated by the roots of the denominator, which we will call 
D(x). There are three cases to consider: 


Case 1: D(x) has real and unequal roots 
Case 2: D(x) has real and repeated roots 
Case 3: D(x) has complex roots 


Real and Unequal Roots 


The denominator takes on the form 


D(x) = x" + by 1x". Бо = (x — x1 )(x — x2): (X — Xm) 


3x—5 
——— d 
1 " 


The denominator has two real and unequal roots, ху = —2 and x2 = 4. 
Hence x? — 2x — 8 = (x + 2)(х — 4) and the integral becomes 


у 3x —5 а= f 3x —5 EM 

x?—2x-8 ^. J (х+2)(х-– 4) 

In this form, the integral is not easily solved. Now we will show that integrals of this 
type can be solved if we expand the integrand into partial fractions, i.e. 


Example 


3x—5 NE. х В 
(x+2)(x-4) х-42 х-4 
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A and B are constants to be determined. Multiplying both sides of the identity by 
(x + 2)(x — 4) also referred to as ‘clearing the fractions’ , yields 


3x—5- A(x —4) + В(х+2) 


As this is an identity, it must hold for all values of x. 
To calculate the values of A and B it is sufficient to insert any value for x we care 
to choose. If, in this example, we insert x = ху = —2 and then x = x2 = 4, we find 


- 
- 


х--2, 3x(-2)-5— A(-2—4); hence A= 


х-4, 3х4—5= В(4+2); hence B= 


AlN д 


The integral becomes 


Гое (6+5) Е 


11 7 
= nix +21+ cinlx - 4+6 


The expansion of a function into partial fractions is carried ош in three steps. 


1. Find the roots of the denominator and express it as the product of factors of the 
lowest possible degree. 

2. Rewrite the original integrand as the sum of partial fractions. 

3. Multiply both sides of the identity by the denominator and then calculate the 


values of the constants of the partial fractions A, B, C ,...,M by inserting suc- 
cessively the roots of the denominator, x1,x2,X3,..., Xn. 
Rule If the roots of the denominator Р(х), x1,x2,X3,..., x4, are real and 


unequal, then we set up 


N(x) A B M 
= + (ester (6.16) 
D(x) (х-х1) (x-x2) (x — Xn) 
Real and Repeated Roots 


Let us consider the integral 


dx dx 
Гез = | exc 


The denominator has roots x, = x2 —2 and x3 = —1. The roots x, and x2 are equal; 
they are called repeated roots. To every r-fold linear factor (x — x;)" of D(x) there 
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correspond r partial fractions of the form 


A1 Аз Аз Ar 


G-n) G-a) G-xS (ау 


Let us return to our example. The integrand is now 


1 СА By B> 
E 5-1 Gan) "22 


To calculate Ау, Ву and B2 we insert three particular values of x. We have, by 
clearing the fractions, 


1 = A(x — 2)? + Bi (x +1)(x—2) + Bo(x +1) 


With x =x, =2 1-43В: hence By = 


Ol Re wile 


with x=x3=-1 1=9A hence A= 
1 
with x=0 (say) 1=4A+(—2B,)+ Вә hence В! ==5 
Thus we obtain 
/ ах =;/ dx 5/ 4х «sf dx 
х3-3х-4 9J (x+1) 9J (x-2) 3J (х-–2)2 
Apart from integrals of the type f du/u, we should recognise the standard integral 


Ји" du. 
According to Sect. 6.5.5, we will have with the substitution и = x — 2: 


jd. ОР bows 11 1 
== — du = = 
| om ЭГ E uu obi oo 


The solution to our example is 


dx 1 1 
= -(In|x +1] —In|x—2 c 
EE Оа ас oy 


Rule If the denominator has a real root хо repeated r times (and some 


other distinct real roots x1,...,Xn), then the integrand takes оп 
the form 
N(x) 2 Ay A2 Ar 
D(x) х-хо (x—xo)? (х= хо)" 
В В 
+t 4 AP (6.17) 


X— X1 X — X2 X — Xn 
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Complex Roots 


If D(x) has complex roots, the method outlined above must be altered. 

For each quadratic expression we will set (Px + Q)/(x? -- ax -- b); P and Q 
are constants to be determined. 

The procedure is best illustrated by an example. 


Example 
| Dx 13x20 
dx 


x(x? — Ax 4-5) 
Again we set D(x) = x(x?—4x4-5) 20 
The roots of the denominator are x1 = 0,x2 = 2— j,x3 — 24 j, i.e. there is only 
one real root. Thus we can write 


252 –13х+20 _ A, Рх+О 
x(xX2—4x45) x х2-4х-5 


In the case of complex roots, the denominator is not split up into linear factors. 
Clearing the fractions we have 


2x? — 13x -20— A(x? — Ax - 5) + Px? + Ох 
The constants A, P and Q are calculated by inserting particular values for x. Hence 


putting x=x,;=0, 20=5A 
putting x=1, 9-24-Р-0 
putting х--1, 35-104-Р-0 


Solving for A, P and Q gives 
4-4, Р--2, 0-3 
The integral becomes 
2x? Zl3x+2 j dx 2x —3 2x —3 
=4 dx = 4Inx— | —————-d 
[we (x2 — 4x 4- 5) x2 4f | аг 4x45 7 ve ears. 2 
The remaining integral can be solved in a slightly roundabout way. In the table of 
standard integrals at the end of this chapter we find 


> = t 
ахь" Vb —a2 Vb а? 


наас сай or 
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Thus with the given denominator we can solve 
2x —4 
| Aen Р 
х2-4х--5 


This is not exactly our integral. But we can transform our given integral to match 
the pattern: 


J 2x —3 d E d | 2x —4 d J 1 d 
—>—— dx = | =— dx = x x 
х2-4х--5 х2-4х--5 x?—4x45 x?—4x45 
Both integrals are included in the table of standard integrals. So the solution of the 
second integral is: 


2x —3 
fe nno? — 44+ 5) + tan 1-2) C 


The final result of our given integral is thus: 
2x? —13x+2 
оо dx = 4Inx + In(x? —4x+5)+tan™!(x—2)+C 


This last example illustrates the fact that integration of partial fractions needs careful 
consideration and that, in the end, we may have to use the whole range of integration 
techniques. 


Rule If the denominator D(x) of the integrand has, e.g. the two conju- 
gates, complex roots x4 and x2, then the expansion into partial frac- 
tions takes on the following form: 

N(x) |. Рх-0 Ш Рх-0 
D(x) | (x — xi)(x — x2) i х2 +ax+b 
(6.18) 


6.6 Rules for Solving Definite Integrals 


The rules for solving indefinite integrals apply equally to definite integrals, e.g. 
a constant factor in the integrand can be placed in front of the integral. The inte- 
gral of a sum or difference of functions is equal to the sum or difference of the 
integrals of the individual functions, and so forth. 

We have interpreted the definite integral geometrically as the area under a curve. 
From this interpretation we can easily derive certain characteristics of the definite 
integral which are geometrically evident. 


176 6 Integral Calculus 


Expansion of an Integral Into a Sum by Subdividing the Range of Integration 


The value of the integral | h f (x) dx remains unchanged if we insert another limit 
с between the limits a and b and calculate its value not from a to b but from a to c 
and then from c to b. Thus 


M rm [fe аха ауа 
= F(c)— F(a)+ F(b) — Е(с) 
= F(b)— F(a) (6.19) 


The rule becomes evident if we consider the problem geometrically (see 
Fig. 6.20). 


f(x) 


Interchanging the Limits of Integration 


If we interchange the limits of integration the integral changes sign: 


b a 
| f(x) dx = -| f (x) dx (6.20) 


b 
Proof | f(x) dx = F(b)- F(a) = -[F(a) — F(b)] 
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Upper and Lower Limits of Integration are Equal 


If the upper and lower limits of integration are equal, the integral vanishes: 


b 
f то) а=о if a=b (6.21) 


Designation 


The value of a definite integral is independent of the designation of the variable: 


[ fo а= fle а= f’ fo du (6.22) 


The value of a definite integral depends only on its limits and not on the desig- 
nation of the variable of integration. We can use whatever designation we like for 
convenience; this is often done in physics. 


Substitution of Limits of Integration 

By means of a suitable substitution, it is often possible to transform a given integral 
into a standard one. The method of substitution discussed in Sect. 6.5.5 is applica- 
ble to definite integrals. Sometimes calculations are shortened by working out new 


limits corresponding to the new variable. The following illustrates the procedure. 


To solve Example 


[ rece [та 


Select а substitution. и = g(x) =V2x-1 
Substitute and change the uj—y2x1-1-21 


limits: uz = ү2х5- 1= 3 
lower limit и = u(a), [ Г 2 
2x — 1 dx = и du 
1 1 


upper limit u2 = и(р). 


Integrate 3 3 
5 42 1 и? du = LB = 23 
лед) аи i 3 
И] 
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6.7 Mean Value Theorem 


If f (x) is a continuous function throughout the range x = a to x = b, then 


b 
| f(x) dx = f (xo)(b — a) 


Within the interval a to b, there exists at least one value xo of x for which the area 
of the rectangle of width (b — а) and height f (хо) is equal to the area under the 
curve within that same interval (Fig. 6.21). 


Fig. 6.21 


It follows, therefore, that the value f (хо) is the mean value of the function in the 
interval considered. Hence 


b 
f (xo) = Ут = zl f (x) dx (6.23) 


In physics and engineering, we frequently find it necessary to calculate the mean 
value of a varying quantity. For example, in the case of a variable force acting against 
some resistance, the work done will depend on the mean value of that force; the 
power in an electrical network is the mean value of the product of the alternating 
current and voltage. 


Example A force applied to a body from sı = Іт to s2 = 8m is given by F = 
s? /2N. Calculate its mean value. 
If Fa = mean force in the interval s2 — s1, then 


8 
иг 1 8 5? 1 [s?] 1 
F(s) ds = ds = 1 _ — (8? — 1) = 12.2N 
21 Qu. 7 аот rb 


This force, Fm, represents the constant force applied to the body which produces 
the same amount of work in the interval as the actual force. 


Fn = 
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6.8 Improper Integrals 


Let the function y = f (x) = 1/x?, with x 4 0 (Fig. 6.22). 


Fig. 6.22 


It is required to calculate the area shown shaded in the figure within the interval 


x=atox=b. 
gere arl 1 
B a a b 


Let A be this area. Then 
There is no particular difficulty in this instance. Suppose we now extend the upper 
limit to the right (Fig. 6.23): the value of the area will increase, and if we allow b to 
grow indefinitely we find that 


b q 1 d 1 
de 3 Ч = jim (2-3) =2 or, more simply, 
"dx 1 
F= — = — 
a x? a 


Such an integral is called an 
improper integral, but its value 
can be finite. 


Fig. 6.23 


Definition Integrals with infinite limits of integration are called improper 
integrals. An improper integral is said to be convergent if its 
value is finite, and divergent if its value is infinite. 


180 6 Integral Calculus 


Integrals whose integrands tend to infinity for 
some value of the variable are also called im- 
proper, e.g. [dx/./x — 1 is improper because the 
function f(x) = 1/ух- 1 at x = 1 is infinite. 
However, the area under the curve in the interval 
x = 1 to x = 2 is finite since 


> dx azil =2 Fig. 6.24 
lh mU x= l7 (see Fig. 6.24) 


Fig. 6.24 


By the above definition we have extended to infinite limits the concept of the 
definite integral, which was originally established for finite limits. 
It should be borne in mind that not all integrals with infinite limits are convergent. 


Example 
9 dx 
a X 


, 450 


First consider ЇР dx /x where b is finite. Its value is Inb — Ina. 
If we now allow b to grow beyond all bounds, the term In b tends to infinity. 
Hence the integral has no finite value: it is an improper divergent integral, i.e. 


[5 
— — oo 
a X 


Example Work done in the gravitational field. 

If U is the work required to move a body of mass m through a given distance 
against the gravitational field produced by a body of mass M (Fig. 6.25), then, by 
Newton’s law of gravitation, the force F between the two bodies is 

mM 
Е = ү —— 
PD 


where y — universal gravitational constant, r — distance between the centers of the 
bodies. The negative sign is due to the fact that the direction of F is opposite to r. 
For a small displacement dr, the work done, dU, is 


M 
dU = Far = үт dr 
T 


Fig. 6.25 
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If the body of mass m is moved from a distance го to a distance г, the total work 
done during the displacement against the gravitational force is 


Fi M rid 1 1 
peg үгс dr—ynM | op =n (==) 
TQ r ro Г T] 


An interesting case occurs when the mass m ‘leaves’ the gravitational field, i.e. 
rı — œ. We find a convergent improper integral: 


ymM 


* dr 
v= mM | — = 
Y ro r2 ro 


6.9 Line Integrals 


As an example, we will consider the force F exerted on a body which depends on 
the position r in space (see Fig. 6.26). This could be, e.g. a gravitational force on 
a mass point or an electric force on a charged particle. We want to determine the 
work U corresponding to the body's movement along some curve from a point P, 
to a point P2. This movement can be described in parametric form 


The components of the force are 


F(r)—(Fx(r) 5,0), Fz(r)) 


"ERST 


x 


Fig. 6.26 


The curve may be thought of as being split up into n small segments. As an approx- 
imation for the work, we take the sum of all fractional amounts of work, assuming 
that the force is approximately constant along each tiny segment (see Fig. 6.27). 
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Ar; P 7 


F; (xi уь 2i) 


x 


Fig. 6.27 


The work AU; corresponding to the ith segment is determined by the scalar prod- 
uct of the force F ; and the vector pointing along the segment Ar;. 


AU; = F; -Ar; 
Thus the whole work is the sum 
ks УЕ; -Ar; 
i 
If we make the elements smaller and smaller we get, in the limiting case, the integral 
Po 
U= F(r)-dr 
P1 


This type of integral is called a /ine integral. The name is based on the fact that the 
path of integration is a curve or a line in space. Let us look at the integral in more 
detail. The force is given by 


Е —(Fx(r) Fy(r) Fz(r)) 


Now our problem is to determine an expression for the path element dr. We start 
with the expression 


r=(x(t), y(t), z(t) 
As 1 varies from 14 to 12, the position vector r moves from P, to P5. The path 
element is given by 


dx(t) dy(t) dz(t) 
dr = dt, ——— dt dt 
3 ( dE de 277 lg 


dr = (dx, dy, dz) 
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Now we calculate the line integral: 


P2 
U = F(r(t))-dr 


Py 
= [^ F(x). 90.210) ar 


t2 dx dy dz 
= F. 4-ЁЕЁ -J+F. -k — dt -i +— dt- j +— dt-k 
[P (Fale) i+ Pole) d+ Fale) 8) (Fari Zay are) 
Hence the formula for the work U reads 


5 dx dy dz 
= F. — dt + F. — dt + Е, — dt 
v = [P (Fl) Gare HONS аг Fl) ап) 


Example Let us consider the gravitational field near the Earth’s surface. It is ex- 
pressed by 
F- (0, 0, —mg) 


Consider the fairground Ferris wheel in Fig. 6.28. We want to find the work done 
during the ascent of the Ferris wheel (mass m). The path is the semicircle from P, 
to P2. Its parametric form with parameter ф is 


r=(0, Кашф, -—RcosQ) 
dr = (0, Ксо8ф, Аѕіпф) аф 


гар (0, 0, —mg)(0, Ксовф, Аѕіпф) аф 
ф-0 


- [7 етв sing) аф = [mgRcos$]g = 2mgR 


Fig. 6.28 
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Forces like gravitational or electrostatic forces are due to conservative fields. This 
means that the work done on a body depends not on the path but only on the points 
P, апар», and that it is independent of time. 

Examples of non-conservative fields are electrical fields caused by induction pro- 
cesses. 

In the case of conservative fields, the line integral can be calculated easily if the 
path is chosen in such a way that force and path are either perpendicular or parallel 
to each other. This means that the path is divided into segments which are easy to 
work with. 


Appendix 


Table of Fundamental Standard Integrals 


The constant of integration has been omitted 


fe [fea f(x) Јах 
1 ИЕ ONES ТРЕ 
c сх 2 2 tan or cot 
х2 +а а а 
п+1 1 1 
x" x (n X —1) 5 ian =! x+a 
п+1 x24+2ax+b ./Б-а2 JD a2. 
(b >a?) 
1 2x +a 2 
= In|x| (x 40) 53222023 а [х= -- ax -- b| 
e* e* 
E а“ а»0 
а ma \aż1 vax +b — y (ax +b)’ 
Inx хшх-х (x>0) : 2 ax+b 
n nx — 
vax +b a | 
: In|x —a| ! шд 
n|x — ———— sin = 
х-а 4/а2-х2 а 
1 1 2 
хэй» а2-х2 х Jaz х2 sin! Ž 
(x —a) х-а 2 2 а 
— tanh! = , 
1 1 х-а |x| < (а| 
x2-a? 2a |х+а| T odi 


орч 


185 


Appendix 
fe) — [fea fe) — f f(x)ax 
1 2 +a? 1 
In ETNA TA —sinh-! - cot ( 2) tan ( 
Vx2+a2 | 1—sinx 4 
Vx? +a2 v x2 +a? + — In(x + уа? + x2) : ant 
1+cosx 2 
1 x х2-а2 1 х 
In tv —cosh-! cot 
\/x2 — a2 a 1—cosx 2 
tan x — In|cos x| 
sinx —cosx tan? x tanx—x 
2 1 ? 1 sin2x А 
sin~ x 2 (x — sin x cos x) eum cotx In|sinx| 
1 x 2 
: In tan 5 | cot" x —cotx—x 
sinx 2 
1 г S 2 
--3 —cotx sin x xsin xc vVl-x 
sin^ x 
соѕ 1х  xcoslx- /1-x2 
cos x sinx tan! x хаа c! x —InV/1 4+. x2 
2 1 А 1 sin2x E E 5 
cos? x 5 (X t sinx cosx) = 5 x+ cot ^x xcot x-cIn/1-x 
1 х л А 
In|tan(>+)| sinh x coshx 
cos X 2 4 
1 : 
апх cosh x sinh x 
2 t h h 
cos? x 
tanh x In|cosh x| 
1 
- tan ( fuas ) coth x In| sinh x| 
1+sinx 2 4 
sinh! x — xsinh-! x — /x241 
cosh"!x  xcosh-!x— /x2—1 
tanh! x = xtanh7! x + In /1— x? 
coth-! x — xcoth-! x - In /x2—1 
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Rules and Techniques of Integration 


b с b 

11 f| лода f fdt [ 3058x 
12 pres jd 18: T ) 
4 Х x= x X — constant 

1 b i a 
1.3 ] feax--[ f (x) dx 
14 Ї f(x) dx =0 

b С b b 
L5 | feo scovs = | ДЭГЭЭ) eid di 
2.1 Integration by parts 


2.2 Integration by substitution 
The integrand is a function of a function; the inner function is taken as the new 
variable. 


b 8(5) 4и 
лв) ае ЇГ. 7005 


By substitution 
u = g(x) 
2.3 Integration by partial fractions 


Proper fractional, rational functions are expanded into the sum of partial frac- 
tions. 


/ 


P(x) - P(x) 
Q(x) шан J (x — a)(x —b)?2(x2 +сх+а) 


2 A 22 By de B5 J Cx+D 5 
= J реа x-b (x—-b) x?+cx+d 


dx 


(P (x) must be of lower order than Q(x)) 


Exercises 


6.1 The Primitive Function 
1. Find the primitives of the following functions and the value of the constant: 


(a) f(x) 23x given F(1)=2 
(b) f(x) =2x+3 given F(1)=0 
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6.4 The Definite Integral 


2. Evaluate the following definite integrals: 
z/2 л/2 л 
(а) | 3cosx dx (b) ! 3cosx dx (c) 1 3сов х dx 
0 -л/2 0 


3. Obtain the absolute values of the areas corresponding to the following inte- 
grals: 


0 2 4 
(a) [6-24 (b) i) (x — 2) dx (c) 1 (х-2)4х 


6.5 Methods of Integration 
4. Integrate and verify the result by differentiating 
24х |^ x-1 
—— +С 
e Їл (xD? x+1 i 
1 
(b) 2 [si (4x — 1) dx =x—- g sin(8x — 2) +С 


1-x? Xx 
(c) Її 1+ х2)2 anre 


5. Evaluate the following integrals by using the table of standard integrals 
given at the end of Chap. 6. 


dx 1 a 
(a) ЇТ: (Ы) UR ex (© т i 
(d) 1 sin? o do (e) | a’ dt (f) i Vx? dx 


(g) 502 as 0) ПЄВЭ = 
6. Integrate by parts the following integrals: 

(a) [xm dx 

(b) 12 cos х dx 

(c) ] 2»: dx 

(d) [cosh dx 


(e) Find the reduction formula for | cos" x dx (л #40) 


(f) Find the general formula for di x" Inx dx 
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7. Use a suitable substitution to evaluate the following integrals: 


(a) | sin(zx) dx (b) Ї 3e?*-6 dx 
(с) 152 (d) Јах) ах 
2x 
8. 2x d Dak 
(a) | сохах о ах 
x39 sinh и 
©) anes о (@ lan 


9. (a) | (sin x48 sin? x-+ sin x) cos.x dx 


E (b) | x Ww das — 1 dx 
(d) | xcosx? dx 


10. Mixed questions 


(à) [ze (b) | 5-5) dx 

(c) | «ухах 

© [x Te 
(1+ x2)tan-! x 


11. Using partial fractions, integrate the following functions: 


1 2x +3 
(a) 2—x—-x? i x(x — L)(x 4-2) 
x P x 
MU ETE а 525 
1 2 
(©) cmo х -1 
po (D x44x7241 
(g) 


(x — 1)(х2+2х+ 5) 
6.6 Rules for Solving Definite Integrals 


12. Evaluate the following definite integrals: 


2 1 
о f (x5 — 8x3 4-47) dx (b) dx 
291) 0 14x 
2 125 
(c) | sint dt (d) 3 | dt 
0 100 


13. Find the value of the absolute area between the following boundary lines: 


(а) y 2x3; x-axis; 4-2) 5-2 
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3 5 
(b y=cosx; x-axis; a= 28 b= ги 
(c) What is the value of the area between the curves y = 4x? and y = 
6x? — 2? (Hint: Sketch the graphs of both functions first. Note that for 


x = | both curves have a point in common, but do not intersect.) 


6.8 Improper Integrals; 6.9 Line Integrals 


14. Integrate the following: 


15. 


16. 


17. 


18. 


"d * dx 99 dr 
@ [ g »[ ev 
4 a E ro t 
d œ dr оо 1 
(4) 1 * (e) | 23 (f) | (1+5) dx 
-1 dx MB! i 2 
шил h | —— dx 
ef 3 Of 
A force in a conservative field is given by 
Е = (2,6,1)N 
А body is moved along the line given by 
r(t) 2 roti 
from point r (0) = ro to point r (2) = ro + 2i . Calculate the work done. 


A force in a conservative field is given by 
Е —(x,y,z)N 
A body moves from the origin of the coordinate system to the point 
Р = (5,0,0) 
Calculate the work done. 


Given the force 


Е = л У 
М2 ty? xt y? 
Evaluate the line integral along a semicircle around the origin of the coordi- 
nate system with radius R. Can you give the answer without computing? 


Given a force F = (0, —z, y). calculate the line integral along the curve 
= 2t 
r(t)= 22 cos2t, 2) 
л 


from? = 0107 = 5. 


Chapter 7 
Applications of Integration 


The purpose of this chapter is to consider some of the important applications of 
integration as applied to problems in physics and engineering. Its objective is two- 
fold. Firstly, it demonstrates the practical use of the integral calculus to readers 
who are particularly interested in applications. Secondly, other readers may use this 
chapter as a reference when practical problems are encountered. 

You will remember the calculation of areas discussed in Chap. 6 as one typical 
example. We will consider this problem again and move on to the calculation of 
volumes, lengths of curves, centroids, centers of mass, moments of inertia, and cen- 
ters of pressure, all of which are frequently encountered in practice. In line with the 
notation used in most technical books, we will now use the symbol 6 instead of A. 
Both refer to the same concept, that of a very small but finite increment. 


7.1 Areas 


By definition, the area of a plane figure is the product of two linear dimensions, e.g. 
the area A of a rectangle of width W and length L is A= WL square units. We will 
calculate areas bounded by curves. Consider the curve CC, shown in Fig. 7.1. We 
wish to calculate the area bounded by a portion PP» of the curve and the x-axis. 
P, has coordinates (x1, y1) and Pz coordinates (x2, уз). At x and x + ёх we 
erect two perpendiculars to the x-axis to meet the curve at B and B' respectively; 
the strip thus formed is a rectangle (or nearly so) whose area is approximately given 
by yóx where y is the mean height of the rectangle. As we saw in the previous 


chapter, the area lies between AB -ôx and A’B’ - ôx; however, 8х can be as small 
as we like. The total area A under the curve from x4 to x2 is the sum of all such 
rectangles, and as we take 6x smaller and smaller the area is given by the following 


definite integral: 
x 


2 
А= | ydx (7.1) 


ХІ 
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Fig. 7.1 


To evaluate it, we must know y as a function of x, i.e. y = f(x): 
A= f(x)dx = F(x2)— F(x1) 


Example Calculate the area A bounded by the parabola y = 2 + 0.5х2 and the 
x-axis between x = 1.5 and x = 3.5 to 3 decimal places. 


3.5 0.5 3.5 
A= | (2--0.5x?) dx = [x+ a 
1.5 3 1.5 


0.5 
= 2(3.5— 1.5)4 3 (3.53 — 1.53) = 10.583 square units 


Example This example is taken from a problem in thermodynamics. Figure 7.2 
shows the path corresponding to a gas as if it expands in a cylinder against a piston; 
р 18 the pressure and V the volume. 
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The work done by the gas during expansion is given by the area under the p — V 
curve. Thus the work done, 6W, in expanding from volume V to a volume V + ôV 
is given by 

ôW = póV 


where p is the mean pressure in the interval óV. The total work done in expanding 
from a pressure рі, volume Vj, to a pressure p2, volume V2, is 


V5 
W = рай (units of work, i.e. Joules in SI units) 


Vi 
To evaluate this work we need to know the expansion law relating pressure and 
volume. There are two important cases to consider: 


(a) the isothermal case in which the temperature is constant throughout the whole 
process and pV is constant; 

(b) the adiabatic case in which there is no flow of energy through the walls and pV” 
is constant. 


Case (a): pV = constant = С; hence p = б. 
The work done is 
С V2 ау 


w=] —av-c[| — 
y V y V 


y. Vo 
= [C InV]y? = Cy. = pV In — 


Case (b): pV” = C ,n > 1, e.g. n = 1.4 for air. 
Hence p = Ts and the work done is 


Vo dV Vo C 
W=C — =C V” dV = —— yi-n _ypl-n 
A yn y a 2 1 ) 
1 1- 1- 
= mae ”- Cv; 2) 


Since C is a constant, we can write С = p2 V7 = pı Vř. 


V> — 
w = R2 рїї 
1-0 


Substitution gives for the work done. 


Complementary Area 


Referring once more to Fig. 7.1, we may in particular cases wish to calculate the 
area bounded by the curve and the y-axis between у = y; and y = y2 as shown. We 
proceed as before and consider a small strip of mean length x and width ôy whose 
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area 8 А2 is xóy. The total area is 
A= [xay (7.2) 


To evaluate it, we must know the functional relationship, i.e. x = g(y). 
The area A» is often referred to as the complementary area. 


7.1.1 Areas for Parametric Functions 


Occasionally a curve is defined by parametric equations of the form 
х= f(t) and y=g(t) (cf. Chap. 5, Sect. 5.10) 


In this case, the areas are given by the following integrals: 


*2 2 dx 12 dx 
A= dx = = | t)— dt 7.3 
A. и 7а ЯГ (7.3) 
The limits їү and fz are those values of t which correspond to хү and x». 
Similarly, the complementary area: 


y2 о d t2 d 
Ai =) xdy= f x a= | f(t) dt (7.4) 
» t dt ti dt 


Example The cycloid (Fig. 7.3) is given by the equations x = a(6 — sin 0), 
y =a(1—cos@). Calculate the area between the x-axis and one arc of the curve. 
The angle turned through is 27r. 

Remember that the cycloid is a curve traced out by a point P on the circumfer- 
ence of a circle which rolls without slipping along the x-axis. It has already been 
introduced in Chap. 5 (cf. Fig. 5.38). 
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The area required is 


2ла 2л ах 
4-1 yax= | a(1— cos0) — d0 
0 0 dé 


But dx /d@ = а(1 — cos 0), so that 
2л 2л 
A=a? | (1 —со50)2а0 =a? | (1 — 2cos0 + cos? 0) 40 
0 0 


Using the table of standard integrals, appendix Chap. 6: 


in2 2л 
А =а? L 2sin 4 gom 4 = Зла? 
2 4 0 


7.1.2 Areas in Polar Coordinates 


The equation of a curve is in some cases expressed in polar coordinates (r,0),r 
being the length of the radius vector measured from the origin O and 0, the angle it 
makes with a known direction, as shown in Fig. 7.4. 

Suppose that we require the area bounded by the radii OC and OD and the curve 
CD. Consider a small sector OC'D': 


oc’ = Nie, OD’ = rid órj 


The angle between OC' and OD' is 80. 
Now we will work out an approximation for the area of OC'D'. 
Let C" cut the line OD’ so that OC" = r; and C'C" = rj60; 
The area A4, OC'C", is given by 


1 
6A = л base x height 


1 1 
- 2СС” х OC’ = 572601 


Fig. 7.4 
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An approximation for the total area A will be the sum of all such small areas, i.e. 


Now let us take 60; smaller and smaller so that n becomes very large. Then, in the 
limit, the area is given by an integral: 


alas eat yu 7.5 

= 1.257 257 (7.5) 
Example The area A of the circle may be considered as being generated by a line 
of constant length, its radius, rotating through 27 radians about its center. The area 
is then given by 


1 2% 1 2л 1 
А = 21 r?d6 = 221 40 - 3 an = лг? 


Example The curve represented by the equation r = аѕіп 30 consists of three 
loops, as shown in Fig. 7.5, lying within a circle of radius a. As 0 varies from 0 
to 2/3, the radius r traces the loop OABC. Calculate its area if a = 250mm. 
The area of one loop is A = 12:05 r? d0 = 1/247. a sin? 30 40 

Using the table of integrals we find 


in2k 
| мэх dx = - 2 = +C 


0.252 ~ 0.016m? 


0 60159 1 үл za? л 
| | 2 ( 0) = 07397 


Fig. 7.5 
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7.1.3 Areas of Closed Curves 


Let ABCD be a closed curve (see Fig. 7.6) such that it cannot be cut by any line 
parallel to the y-axis at more than two points, and all ordinates are positive. AA' 
and CC’ are tangents parallel to the y-axis and OA’ = а,ОС' = Б. 

The area A enclosed by the curve is 


b b 
4-1 BD’ ax- [ DD’ dx 
a a 


where the points B and D move along ABC and ADC respectively. Let us denote 
BD’ by f2(x) and DD’ by fi(x). 
The area is 


Fig. 7.6 


Example Calculate the area enclosed between the straight line y = 4x and the 
parabola у = 2+ х2. 
Itis wise to sketch a graph of the two functions, as shown in Fig. 7.7. 
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The required area is shown shaded. The curves cross each other at A and B, 
corresponding to x = a and x = b, respectively. We need to calculate the values of 
a and b, our limits of integration. These are given by solving the equation 


Ax 224 x? 


This is a quadratic equation whose roots are Хү = а = 0.59, x5 = b = 3.41 to 2 d p. 
Since the straight line between A and B is above the parabola, we have 


Љ(х) = 4х, f(x) 224 x? 
Hence the area is given by 


3.41 " 5 Р” 3.41 
A= (4x -2— x^) dx = |2x^ —2x-—=x = 3.77 square units 
0.59 0.59 
Let us suppose that the coordinates of a point P(x, y) on a closed curve (see Fig. 7.6) 
are given in terms of a parameter f, such that ¢ increases from 1, to t2 as we travel 
round the curve once. The point travels from A to C, via B, and from C back to A, 
via D. The equation for the area A of the closed curve becomes 


t5 d 
4-1 Е аг 
ty dt 


Example Suppose that the closed curve ABCD (Fig. 7.6) is an ellipse whose equa- 
tion is 
(1-4) (y-k? 
a? + b2 
What is the area of the ellipse? 
Let x = h — a cost and y = k + bsint. 
Then, as f varies from 0 to 27r, a point Р(х, y) goes round the curve in the direc- 
tion ABCDA. 
The area is 


-1 (л, К,а and b are constants) 


2л 2л 2л 
| (k + bsint)asint dt = ka | sint dt +ab | sin?t dt = nab 
0 0 0 


2л 
Note that the first integral = ka f sint dt = 0. 
0 


7.2 Lengths of Curves 


In this section we will derive formulae for the length of a curve. (In fact, one of 
these has already been used in Chap. 5, Sect. 5.8.3) Consider the curve defined by 
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Fig. 7.8 


the equation y = f(x), shown in Fig. 7.8, and a small portion BC, B and C being 
close to each other. 

Let ôs = length of the arc BC, BD = 6x and CD = бу, as shown by the small 
triangle BCD. 

Then the arc BC is nearly equal to the chord BC, so we may write 


(85)? ~ (chord BC)? = (8x)? + (8y)? 


Therefore 


Hence, as óx — 0,05/óx — ds/dx and dy /óx — dy /dx and 
ds/dx = y1 + (dy/dx)? and ds/dy = 4/1 + (dx/dy)? 


The total length s of the curve from A to E, corresponding to x — a and x — b, 
respectively, is 


b 2 b 
вээ (2) a= f (1 +y?) dx (1.6) 


The length is also given by 


d 
r=} (14-x2)!/2 dy 
с 


Example Let us find the length of ће circumference of a circle of radius А, which, 
of course, is well known to us. 
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The equation of a circle is 
x24 y? = К? 


Differentiating implicitly with respect to x gives 
d 
оху -0 ог уу = – 
ах 


Hence y' = —x/y = —x/A/ R? — x? and (1+ y?) = R2/(R? — x?) 


The length of the circumference 


1 
L = 4 x length of 1 circumference = 


dx 
4R 1. 
v R? — х2 
Note that to evaluate the integral we can substitute x = Rsin 0. 
Then dx = R cos 6 dð, so that 


л/2 Rcos0 do — 
L=4 ji ERR RAE. 
0 


4R d 2nR 
Rcos0 12 Deer 


Example Evaluate the length of a parabola from the origin to x — 2. 
The equation of the parabola is y = 1/4 х2. 


The required length of the curve is 


2 2 P 1/2 
f, a-»ax- f (1-2) dx 
0 0 
Les 231/2 
= 2| (4+ x^) /? dx zz 2.3 units of length 
0 


Note that the integral is of the form f уа? + x? dx which is included in the table of 
standard integrals on p. 184. 


1 2 
| a? +x? dx = zx a? x! e T In(x+ а +2) +С 


You will soon discover that evaluating lengths of curves can be very laborious; in 
fact, there are few curves whose length can be expressed by means of simple func- 
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tions. This is due to the presence of the square root. In most cases the lengths are 
calculated by approximate means. 


7.2.1 Lengths of Curves in Polar Coordinates 


Referring to Fig. 7.9, which shows a detail from Fig. 7.4, consider the small triangle 
C'D'C", We have 


C'C'" —r80, C'"D'—ór and C'D'—-ós 


Fig. 7.9 


Using Pythagoras’ theorem, (C’D’)? = (C'C")? + (C"D^)?, we can write 
(85)? = r° (80)? + (8r? 


We obtain one expression with respect to 0 and one with respect to г: 


ву ү? | 80N? 
2 — == 2 ЕЎ 
r 485) 69 or ôs 1+7 (22) ёг 


As ёг — 0,80 — 0. The length of the curve is then given by an integral: 


1/2 1/2 
02 dr \? r2 do\? 
5 = 5 ( 45) 49 ог 8 ji (i+ (7) dr (7.7) 
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Example Calculate the length of the cardioid whose equation is r = a(1 + cos 0). 
Ө varies from 0 to 27. The curve is symmetrical about the x-axis (Fig. 7.10). 
Because of symmetry, the length will be twice that given by letting 0 vary from 
0 юл. 
Since r = а(1 + соѕ0), 7/40 = —a sin 0 


Fig. 7.10 
The length is given by 
ee 24 2632 91 1/2 
L=2] l|a*(1-cos0)^ --а sin’ 6| 40 
0 


л л 0 
=2a | (2+2c0s6)"/? 40 = 4a | cos 5 d6 = 8a 
0 0 


7.3 Surface Area and Volume of a Solid of Revolution 


When a solid of revolution is generated, the boundary of the revolving figure sweeps 
out the surface of the solid. The volume of the solid depends on the area of the 
revolving figure, and the surface generated depends on the perimeter of the revolving 
figure. 
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Consider the curve AB, defined by y = f (x), and shown in Fig. 7.11 between x =a 
and x — b. 


Fig. 7.11 


Let us revolve the curve AB about the x-axis. Two figures are generated: (a) 
a surface and (b) a solid. If we consider a small strip of width óx and height y, then 
the small surface generated is given by 5A = 27ryós, where 08:18 the length of the 
curve corresponding to 6x. The total surface will be the sum of all such elements, 
i.e. surface zz X27 yds. If бх becomes smaller and smaller we have, in the limit, 


b b 38 2\ 1/2 
4-1 2лу 02 | yy i+ (2) dx (7.8) 
a a dx 


Furthermore, as the strip is rotated, it generates a thin circular slice whose volume 
ôV is approximately 
8V = лу28х 


For the whole curve, as 6x — 0, the volume of the solid generated is 
b 
Ven 1 y2dx (7.9) 
a 


Example The straight line y = mx is rotated about the x-axis, thus generating 
a right circular cone, as shown in Fig. 7.12. Calculate (a) its surface area and (b) its 
volume. (Of course, the results are well known. They are usually obtained without 
using integral calculus.) 
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y=mx, уст 


Fig. 7.12 


(a) The surface area is 


b 272 
А=2л | mxV1+m?2 dx = emen 
0 0 
= nmb? у 1+ m? = x RA/ b2 + R2 


where R is the radius of the base of the cone and m — R/b. 
We can express the surface area of the cone as 


A-—zRL, where L —slant height = yb? + R? 
(b) The volume is 
b b 
ке, лу?бх=лт? | x? dx 
0 0 
b 2 
1 1 1 R 1 
= лт? ВУ : = qnm b? = 57 ГУД = z7R°b 
Hence 
Surface area of a cone = 1/2x circumference of base x slant height. 


Volume of a cone = 1/3 xarea of base x height (1/3 of the volume of a cylinder 
having same base and height). 


Example Calculate (a) the surface area and (b) the volume of a lune of a sphere of 
radius R and thickness h (see Fig. 7.13). 

The surface will be generated by rotating the arc AB and the volume by rotating 
the area ABCD about the x-axis. 
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Fig. 7.13 


From the figure, we have 
Differentiating implicitly gives 


Thus 
2 2 2 R2 


y = y2 R2— x2 


(a) The surface area is 


b 
А=2л | (R2 — х2)1/2 
а 


b 
gum -олв| dx = 2xR(b — a) 


Hence A = 2x Rh 
(b) The volume V is 


b 2 Dodd ct 2 x 
у= | y dx =x | (R*—x ГЕЯ x-7] 
a a 3 a 
For the special case where b = R and a = 0, we have 
2 
V = спр? 
37 
This is the volume of a half sphere or hemisphere. Hence ће volume of а sphere 
is V = $nR?. 


Example Small aluminium alloy pillars having a parabolic profile are manufac- 
tured by turning down cylinders 125 mm in length and 50 mm in diameter. The di- 
ameter of the pillars at the thinner end is to be 30 mm. Calculate the amount of metal 
removed. (The density of aluminium is 2720 kg/m?) 
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Fig. 7.14 


Figure 7.14 shows the required profile of each pillar and the amount of material 
to be removed is indicated by the shading. 
The volume of material removed is 


Volume of the cylinder — volume of the pillar = л R5?h — volume АВВ”А” 


With axes as shown, the equation of the parabola AB is 
y =a+bx? 


To find the values of a and b, we note that when x = 0, y = А (= 15mm) and when 
x =h,y = Ro(= 25mm); also h = 125mm. 
Hence a = Ry and b = (R2 — R1)/h?, i.e. a = 15,b = 10/125? = 0.00064 
Rotating the element of width ôx about the x-axis gives the volume of the slice 
as лу28х; the volume of the pillar is 


h h h 
V= x | у?іх = т | (a+ bx?)?dx = т | (a? + 2abx? + b?x^) dx 
0 0 0 


2 2 
= gh («+ Зат м) 


Substituting numerical values gives 


2 000642 
V = 125л G тах 15 x 0.00064 x 125? + — x us) 


= 0.135 x 106mm? = 0.135 x 107? m? 
Volume of cylinder = z R2h = л x 25? x 125 ~ 0.245 x 105mm? 
= 0.245 x 10-3 m? 
Material removed ~ (0.245 — 0.135) x 107? x 2720 ~ 0.3kg 


If some arbitrary closed curve is rotated about the x-axis it generates a solid, i.e. 
a ring of irregular cross section (see Fig. 7.6). 

Referring to Fig. 7.6, the volume generated by the thin slice BD is a hollow 
circular plate having radii D'B = y2 and DD’ = y, and thickness бх. 
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Hence its volume V is 
8V = л (y2 ур) ôx 


The volume V of the hollow solid of revolution is 


b 
у= | (y2 — у?) dx (7.10) 
a 


where y; = fi(x) and y2 = /5(х) are the equations of the curves ADC and ABC 
respectively. 


Example Calculate the volume of a solid ring (torus or anchor ring) obtained by 
rotating a circle of radius R about an axis distant / from its center ( > R). 

The circle is conveniently positioned, as shown in Fig. 7.15, relative to the x- and 
y-axes. O' is the center of the circle. 

Consider any point P with coordinates (x, y). Consider the triangle O’A’P; we 
have (у — h)? + x? = R?. Solving for y gives y = h + VR? — х2. 

The two functions /1(х) and /5(х) are 5 (х) = А+ V R2 — x? for portion ABC 
of the circle, and f1(x) = h — v R? — x? for portion ADC of the circle. 


Fig. 7.15 


Therefore 


ôV =x (82- л?) bx = An hw R2 — x2óx 


208 7 Applications of Integration 


and the volume of the ring formed is 


R 
V= Anh | v R2 — х2 dx = 22? R?h 
-R 


Note that the integral can be solved by letting x = R sin 0 (cf. the table of integrals). 


7.4 Applications to Mechanics 


7.4.1 Basic Concepts of Mechanics 


When a rigid body moves in two or three dimensions under the action of forces it is 
the same as if the whole mass of the body was concentrated in one point with all the 
forces acting through that point, giving the body a translation in the direction of the 
resultant force. Furthermore, the body rotates about an axis through that point under 
the action of the resultant moment of the forces about that axis. The point referred 
to is called the center of mass of the body. 

If M is the total mass of the body and F the resultant force, then Newton's 
second law of motion states F = МЕ (vector equation), where # is the acceleration 
of translation. 

Also, if H is the angular momentum of the body, then 


d 
Moment of the external forces — T (H) 


It can be shown that H = Го, where I is the moment of inertia of the body about an 
axis through the center of mass and o is the angular velocity of the body. Remember 
that the moment of inertia is (the sum of) the product of a mass by the square of its 
distance from the axis of rotation. 

When studying the motion of a rigid body, e.g. a car, an aircraft, a link in a mech- 
anism, etc., we need to know the position of the center of mass and the moment of 
inertia. 

Another important point is met when studying the forces acting on a body which 
is immersed in a liquid. This point, called the center of pressure, is the point where 
the total pressure on the body is supposed to act. 

We will consider these three concepts in some detail. 


7.4.2 Center of Mass and Centroid 


Consider a system of n particles P; whose masses are т; (i = 1,2,...,7); let the 
coordinates of these particles, referred to a Cartesian set of axes x, у, 2, as shown in 
Fig. 7.16, be xj, Уг, zi. 
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Fig. 7.16 


If M is the total mass of the particles, the position of the center of mass G is given 
by the following equations: 


1 12 11:55 
х= — Vx, = — Уту, 2-2 Ута 
М; М £3 М i= 
п 
where М = Ут; 


The product mass x distance is often referred to as the first moment. 

When the particles form a solid body, the above summations become integrals. 
If 5m is the mass of a typical particle in the body at distances x, y and z from the 
planes, then the center of mass of the body is given by 


between appropriate limits. 


nz = M = total mass of the body 


A plane figure of area ABCD may be considered as a thin lamina. Its center of mass 
is found by taking moments about the x- and y-axes (Fig. 7.17). 

Let it be a requirement to find the position of the center of mass, G, of the thin 
lamina ABCD of mass m per unit area shown in the figure. The small strip has 
a mass myóx. Hence, by the above equations, if X is the x-coordinate of the center 
of mass, G 


b b 
х= арту І eas (7.11a) 
[P ту dx A Ja 
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Fig. 7.17 


A is the total area = ЇР у dx. We note that m cancels out. 
If we now take moments about the x-axis, we have, for the y-coordinate y of the 
center of mass, G 


1 b 
Ӯ = al y?dx (independent of m) (7.11b) 
2A Ja 


(The moment of the small strip about the x-axis is y/2(myóx) = 1/2ту28х.) 
G in the case of an area or a volume is usually referred to as the centroid. 


Example Find the center of mass G of a thin strip AB bent into a circular arc, as 
shown in Fig. 7.18. The mass per unit length is m and the radius r. The arc subtends 
an angle 20 at the center O. 
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Taking axes as shown, it follows that the center of mass G lies along the x-axis. 
Consider a small element PP’ of length 85; its mass is més = mró0, using polar 
coordinates. 

The moment of PP’ about the y-axis is xmré6 = mr? cos060. 

Hence the position of G is 


[2g mr? cos6 dé Е 2mr? sind | rsin 


иж [85 mr a8 |. 2mrÜ | 0 


If the strip is bent into a semicircle, 0 = л /2 and 


2r 
л 


x= 


Example Determine the center of mass G of the solid cone shown in Fig. 7.12. 
The equation of the straight line is y = R xX. 
The mass of the thin slice obtained by rotating the element ôx about the x-axis 
is тлу28х, where m is the mass per unit volume. The total mass of the cone is 


b R? pb 
М = тл | y?dx =ma | x?dx 

0 b? Jo 

1 

= -mr R?°b 
3m 
The moment about the y-axis of the slice is 
R2 
хтлу бх = тл ax 8x 


Hence the total moment = (mz R2)/b? (7 хЗах -1/4тл8252 
The position of G, which lies along the x-axis, is given by 


р їтлК252 3 
Wu Hop m 
4mnR?b 4 


7.4.3 The Theorems of Pappus 


Pappus' First Theorem 


Let AB be an arc of length L measured between x — a and x — b (Fig. 7.19). 
When it revolves about the x-axis, it generates a surface of revolution whose area is 


S = 2л fey ds. 


212 7 Applications of Integration 


Fig. 7.19 


If y is the ordinate of the centroid (of the arc) G then 


b 
pL = | y ds 
a 


Multiplying both sides by 27 gives 
b 
S-2njL-2« | y ds (7.12) 
a 


This is known as Pappus' first theorem. It states that the area of a surface of 
revolution is equal to the product of the path travelled by the centroid (of the arc) 
and the length of the generating arc. 


Pappus’ Second Theorem 


The volume of the solid of revolution generated by the area bounded by the arc, the 
ordinates at x — a and x — b and the x-axis is given by 


b 
у= | y?dx 
a 


If we denote the area by A, the centroid (of area) by G' and its ordinate by y’, then 


2 1 f? 
yA-;] y?dx 
2 Ја 
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Multiplying both sides by 27r gives 
E b 
V=2ny'A=n | y?dx (7.13) 
a 


This is known as Pappus' second theorem. It states that the volume of a solid 
of revolution is equal to the product of the path travelled by the centroid and the 
generating area. 

These two theorems apply also to a closed curve, provided it does not cut the axis 
about which it is rotated. 


Example Calculate (a) the surface area and (b) the volume of a torus (cf. Fig. 7.15). 
O' is the centroid in this case. 


(a) By the first of Pappus' theorems for the surface area we have 
S —2nyL 


But L = 21 R, ӯ = h; hence S = 2zh x 2x R = 4л2 Rh 
(b) By the second of Pappus' theorems for the volume we have 


V = 2луА 


But A = z R2, y = h; hence V = 2z hx R? = 2n? R?h 


Pappus' theorems are also useful in obtaining the centroid of a curve or an area 
when we know the surface or volume generated. This is illustrated by the following 
example. 


Example Find the position of the centroid of one quarter of a circular area of ra- 
dius R. 
Rotating the quarter circle about the x-axis gives a hemisphere. If y is the posi- 
tion of the centroid, we find 
mR? 2 


21 —— = =n R? 
лу 1 37 


Непсе 


R 
ӯ = — (note that x = y by symmetry) 
л 


7.4.4 Moments of Inertia; Second Moment of Area 


Moments of inertia play an important role in the study of the motion of rigid bodies. 
Equally important is the concept of the second moment of area which arises, for 
example, in the study of beam bending, torsion of bars and in problems involving 
surfaces immersed in fluids. 
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Moments of Inertia 


Definition The moment of inertia J of a mass M at a distance / from 
a fixed axis is given by 


1= мІ 


If we have a system of n masses M; at distance /; from the fixed axis, then the total 
moment of inertia is 


n 
I= Y Mg? 
(--1 


i= 


When the number of masses is infinite, i.e. when they merge into one mass forming 
а rigid body, the summation becomes an integral. 

Figure 7.20 shows a rigid body with P a typical particle of mass ôm at distances 
x and y from a Cartesian set of axes. 


By definition, the moment of inertia of that particle about the x-axis is у28т and 
the moment of inertia of the whole body about that axis is 


Ix = | am limits are defined by area A . (7.14) 
A 
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Similarly, the moment of inertia about the y-axis is 


I= 1 х24т (7.15) 
А 


It is important to specify the axis by a subscript unless it is obvious from the nature 
of the problem. 


Example Obtain the moment of inertia of a thin disc of radius А, thickness h and 
density p about a diameter (Fig. 7.21). 


Fig. 7.21 


Consider a strip PQ of length 2x and of width dy, parallel to the x-axis and 
at a distance y from it. The moment of inertia of the strip about the x-axis is 
2xy?hpóy. 

Hence, for the whole disc 


R 
Ix =2ph | xy?dy 
-R 


The integral may be solved by substituting x = Rcos0,y = Rsin@ and conse- 
quently dy = Rcos0 40: 
mpR*h 

4 


л/2 
1, = зо“ | cos? 0 sin? 0 10 = 
0 


If we denote the mass of the disc, z R?hp, by M, then I% = 1/4 M R2. This result is 
obviously true for any diameter. 
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Example For an axis through O perpendicular to the disc (the z-axis) let us take 
a thin ring at a distance r from the axis and of width ór (Fig. 7.22). Its moment of 
inertia is, by definition, r25m = r?2z rhórp = 2nphr?ór. 


(B 


Fig. 7.22 


The total moment of inertia of the disc about the z-axis is 


R 1 1 
I; =2xph | r?dr = 2xphR^ = -M R? 

0 2 2 
This result is also valid for a long cylinder. Iz is often referred to as the polar 
moment of inertia. 


Example A flywheel is an element with many practical applications. It consists 
basically of a hollow thin disc. The hole is necessary so that the flywheel can be 
supported by a shaft. In Fig. 7.23 the cylinder (known as a boss) FHJG is to ensure 
a good support on the shaft. 

Calculate the moment of inertia of the flywheel about its central z-axis. It is made 
of steel whose density is 7 800kg/m?. The sketch on the right of the figure shows 
a cross section through the diameter AB. 

The moment of inertia 7, is the sum of the moments of inertia of the discs that 
make up the flywheel: 


I; = I (CDEI) + I(FHJG) — I(KLMN) = I; + I2 — I3 


The moment of inertia of the hole has to be subtracted. Each element is a thin 
disc whose moment of inertia about the z-axis is 1/2 M R? (previous example) or 
1/8 MD?, where D is the diameter of the element. 

Let us first calculate the masses: 
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D 
40 mm 
D; = 150 mm 
| D3 = 450 mm 
Fig. 7.23 
T n2 л 2 
Mi = 4 Dihip = т x 0.45? х 0.085 x 7800 = 43.42kg to 2 d.p. 


Similarly, we find M5 — 8.96kg, M3 — 0.98kg. 
The moment of inertia about the z-axis is 


1 
I; = 3 (43.42 x 0.457 + 8.96 x 0.157 — 0.98 x 0.042) = 1.124kg/m? 


Perpendicular and Parallel Axis Theorems 


Consider the particle of mass óm at P shown in Fig. 7.20. Its coordinates are x 
and y. The body is a thin plate in the x- y plane. 

The moments of inertia of the small element are y?óm about Ох, х28т about 
Oy and г28т about Oz. 

Since r? = x? + y?, multiplying through by ôm gives 


г28т = х28т + у28т 


[Pan = [Pam | m 


Hence Iz = Гу +1 (7.16) 


By integration, we find 


This is known as the perpendicular axis theorem. 
For example, we saw that the moment of inertia of the thin disc in Fig. 7.21 was 
1/4 MR? about a diameter. It follows from the perpendicular axis theorem that 
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1 1 
1,-21,-2х MR = 5 MR 


Now let us consider the body shown in Fig. 7.24. Its center of mass is at G, and 
there are two parallel axes, one through G and another AB at a fixed distance d. The 
moment of inertia about AB of a small strip of mass 6m is 


ám(x +d)? 


The moment of inertia of the whole body about AB is 
IaB = fta? dm = [Pans fÈ am+ f2xd dm 


= [am 4? fam+2d | хат 


Since f x dm = 0 by definition of the center of mass 
Iag = Ig Md? 


This is known as the parallel axis theorem (Steiner’s theorem). 


Radius of Gyration 


If M is the mass of the body and k a distance such that the moment of inertia of the 
body J is expressed by 
I= Mk? 


then К is known ав the radius of gyration about the axis. Physically it means that we 
regard the whole mass to be concentrated at a radius k. 


Example We saw earlier that the moment of inertia of a cylinder about a central 
axis was given by J = 1/2 M R?. If we write Г = M k?, it follows that the radius of 
gyration of the cylinder is K = R/ V2. 
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Example Calculate the radius of gyration of the flywheel analysed in a previous 
example on p. 216. 

Total mass М = 43.42 + 8.96 — 0.98 = 51.40kg. 

Moment of inertia J = 1.124kg/m?. 

Since J = MK?, solving for k gives 


Е С т РЭГ 
MON Эр я 


Example In practice we often need to calculate the moment of inertia of a body. Its 
value can be estimated from drawings, but an experimental verification of the value 
might be required. One way of achieving this is to suspend the body on an axis and 
allow it to oscillate like a pendulum. The time for a number of complete oscillations 
is observed and, by a simple calculation, the radius of gyration is obtained. 

Figure 7.25 shows a body, such as a connecting rod in an internal combustion 
engine, pivoted at O. The total mass is M and its center of mass is at G, which can 
both be obtained experimentally. 


Fig. 7.25 


The connecting rod is allowed to oscillate about the axis O through a small an- 
gle 0. The distance between О and С is 4. In this way we have a compound pendu- 
lum. It can be shown that the period of one oscillation is given by t = 27 4/ ko? /gd 
(g is the acceleration due to gravity and kg is the radius of gyration about an axis 
through О). If Ig is the moment of inertia about an axis through G parallel to the 
axis through O, then by the parallel axis theorem 


1д-14-М42 ог Mk = МК + ма? 
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Hence ko? = kg? + d?, and the required radius of gyration is 


_,/Psd р 
kg = ae l 


Hence the moment of inertia about G is M ЁС, 


Second Moment of Area 


When studying the deflection of loaded beams or the twist in a shaft subjected to 
a torque, we encounter the following expression 


г28А 


where 6A is an element of area in the cross section of the beam ог the shaft and r is 
a distance from some axis. 

This product is known as the second moment of area and is similar to the moment 
of inertia of a body. 

The second moment of area of a plane figure of finite size is 


І = ! г2 dA between appropriate limits (7.17) 


The perpendicular axis and parallel axis theorems are valid for second moments of 
area as can readily be verified. 


Example The rectangle plays an important role in beams. Let us calculate its sec- 
ond moment of area about various axes. 

Figure 7.26 shows a rectangle, of width B and depth D, and two axes: one 
through the centroid denoted NA (which stands for neutral axis where the stress 
in a beam would be zero) and another, XX, at one end. 
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221 


(a) To find Jna, consider the small strip of thickness бу. Its second moment of area 
about NA is 


Bóyy? or By?óy 
Hence, for the whole rectangle 


D/2 BD? 
Ina = B | y*dy = —— or Ina = Akya? 
-D/2 12 


where A = cross sectional area = BD, and kya = radius of gyration = D / v 12. 
By symmetry, it is easily verified that 
DB? 

12 


(b) To find /xx: the distance between the axis NA and XX is D/2; hence, by the 
parallel axis theorem, we have 


р\2 
Ixx = na +a(Z) 


INA, = 


1 D? 61 
= — BD? + BD — = - BD? 
12 г 4 3 
D 
and kxx = —= 
v3 
Example Figure 7.27 shows the cross section of a type of beam known as an 


I-section. Using the dimensions given on the sketch, calculate the second moment 
of area about an axis through its centroid. 


Part 1 


Part 3 


Dimensions in mm 


Fig. 7.27 
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The centroid lies half way up the section which has been indicated as NA. To 
calculate the second moment of area, Гуд, we can divide the area into three parts. 

Parts 1 and 3 are of dimensions B; = 110mm and D, = 15mm. By the parallel 
axis theorem and where D — total depth of the beam: 


2 
1 р D 
Iya, = Bi Di + В.р. (2 - x) 


Substituting numerical values gives 
INA, = 5 x 110 x 153 +110 x 15 x 82.5? = 11.26 х 10°mm4* 
Part 2 is of dimensions B2 = 15mm and D2 = 150mm. Thus 
Ina, = (58:03 - - x 15 x 150? = 4.22 х 10°mm* 


Hence, for the whole section 


Ina = 214 + Ina; = (2x 11.26 + 4.22) x 10° = 26.74 х 1085mm^ 


Center of Pressure 


If a body is immersed in a fluid, e.g. water, then the pressure per unit area of surface 
is not uniform over the body because the pressure is proportional to the depth. The 
point at which the total pressure may be assumed to act is known as the center of 
pressure. 

Consider the plane surface S immersed in a fluid of density р and making an 
angle 0 with the free surface QP, as shown in Fig. 7.28. 


Fig. 7.28 
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For a small area of length y and width ôx, the pressure on it is goh where Л is the 
vertical distance from the surface QP to the element. The force ôF on this element is 


ôF —gphyóx and h=xsin@ 


The total force on the surface is given by 
F=g / psin@xy dx between appropriate limits 
= gpsind | xy ах 


But | ху dx = first moment of area about PP’ = Ax, where A is ће area of ће 
surface. 

Hence F = gpAxsin@ = gpAh. 

To find the point where the resultant force F acts, we take moments about PP’. 
For the element we have 


gphyóxx = gpsinOyxdx 


Hence, if z is the position of the point C where the resultant force acts, then 
Fz= gpsind | yx? dx or gpAxsinéz = gpsind | ул? 4х 


Solving for 2 gives 
| fyx?dx 
0 AR 
Hence, the point C where the resultant force due to the pressure of the fluid acts is 
given by 


(7.18) 


7 Second moment of area about PP’ 


First moment of area about PP’ 
C is known as the center of pressure. 


Exercises 


7.1 Areas 
1. Calculate the area bounded by the positive branch of the parabola y? = 25x, 
the x-axis and the ordinates where x = 0 and x = 36. 


2. Calculate the area bounded by the positive branch of the curve 
у2=(7—х)(5 + x), the x-axis and the ordinates where x = —5 and x = 1. 


3. Calculate the area bounded by the parabola 20y = 3(2x? — 3x — 5) and the 
x-axis between the points where the curve cuts the x-axis. 
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. Calculate the area bounded by the curve у2(х2 + 6x — 55) = 1, the x-axis 


and the ordinates where x = 7 and x = 14. 


. Sketch the curve y = 2x? — 15x? -- 24x +25 between x = 0 and x = 4 and 


then calculate the area enclosed by the ordinates at these points, the x-axis 
and the portion of the curve. 


. Calculate the area bounded by the hyperbola r?cos20 — 9 and the radial 


lines 0 = 0 and 0 = 30°. 


7. Calculate the entire area of the curve г = 3.5 $іп 20. 


8. Calculate the area bounded by the following curves: 


(a) y? = 4x and x? = бу 
(b y 24—x2,y 2 4— 4х 
(с) y = 64 4x — x? and the line joining the points (—2, —6) and (4, 6). 


7.2 Length of Curves 


9. 


Calculate the lengths of the curves given in exercises 1, 2 and 3. 


7.3 Surface Area and Volume of a Solid of Revolution 


10. 


11. 


12. 


13. 


Calculate the area of the surface generated by the revolution of the curve 
y — x? about the x-axis between the ordinates x — 0.5 and x — 0. 


The curve y = x(6— x) — 7.56 is rotated about the x-axis between the points 
where it crosses the x-axis. Calculate (a) the surface area and (b) the volume 
of the solid thus generated. 


Calculate (a) the surface area and (b) the volume generated by rotating the 
cycloid x = 0 — sin 0, у = 1 — cosÓ about the x-axis. 


Calculate the volume generated by revolving the ellipse x?/9-- у2/25 = 1 
about the x-axis. 


7.4 Applications to Mechanics 


14. 


15. 


16. 


17. 
18. 


Find the position of the centroid of the area of one quarter of an ellipse. The 


equation of the ellipse is 
2 2 


x y 

а2 p 
A plate is cut into a circular sector of 375 mm radius and 65? included angle. 
Find the position of the centroid along the axis of symmetry. 


! 


The density of the material of which a right circular cone is made varies as 
the square of the distance from the vertex. Find the position of the center 
of mass. 


A hemisphere has a radius of 125 mm. Calculate the position of its centroid. 


A cylindrical shell has a mass М, a radius А and a length L. Calculate its 
moment of inertia about 


(a) a central axis 
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(b) an axis about a diameter at one end 
(c) an axis through its centroid and along a diameter. 


19. A steel rod is 3.75 m long and of circular cross section of 35 mm diameter. 
The density of steel is 7 S00kg/m?. Calculate the moment of inertia about 
(a) the centroid and (b) one end. 

20. A solid right circular cone has a mass of 165 kg, a base radius of 175 mm 
and a height of 650 mm. Calculate its moment of inertia about a central axis. 


21. A beam has the cross section shown in Fig. 7.29. Calculate its second mo- 
ment of area about an axis through its centroid (NA) and the corresponding 
radius of gyration. The dimensions are in millimetres. 


Fig. 7.29 


22. Calculate the total pressure on the gate in the dam shown in Fig. 7.30 at 
a depth of 5 m. The gate is 2.5 m high and 1.5 m wide. Calculate also the 
position of the center of pressure. Density of water = 1 000kg/m?. 


Water level 


Fig. 7.30 
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23. A triangular plate of base 5 m and height 8 m is immersed in a lake with its 
base along the water level. Calculate the total pressure on the plate and the 
depth of the center of pressure if the plate is vertical. 

Density of water = 1 000kg/m? 


Chapter 8 
Taylor Series and Power Series 


8.1 Introduction 


In Chap. 5 we showed that the sum of a geometric series is given by 


bao eee 

1-х 
This formula holds true for —1 < x < 1. We will now consider this result from 
a different point of view. The left-hand side of the equation is an infinite series in 
powers of x, while the right-hand side is a simple function of x. Figure 8.1 shows 
the graph of this function. 


Fig. 8.1 


The series and the function are identical for a certain interval. It is also possible 
to represent other functions by means of series in ascending powers of x. 

In this chapter we investigate functions which can be expressed as infinite power 
series. An infinite power series is an expression of the form 


agp +a 4x +a2x* +азх? +- = ` ag x" 
n=0 


K. Weltner, W. J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
ISBN 978-3-642-00172-7 © Springer 2009 
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Power series have an infinite number of terms, each term being a simple power of an 
independent variable such as x. We will try to express functions already discussed, 
like the trigonometric functions, as power series. Power series are easy to handle 
numerically. For small values of x, the higher terms of the series decrease rapidly. 
In this case an approximate value of a function can be obtained by taking the first 
terms of the series. 

The expansion of a function as a power series is useful for the following reasons: 


1. Evaluation. The functional values of exponential, trigonometric and logarithmic 
functions, for instance, can be computed numerically with the help of power 
series to a high degree of accuracy. 

2. Approximation. The first terms of a power series can be used to obtain an ap- 
proximate value for a given function. 

3. Term-by-term integration. It is not always possible to integrate a function as it 
stands. If, however, the function can be represented by an absolutely convergent 
power series it can then be integrated term by term to give the value of the 
integral to a high degree of accuracy. 


8.2 Expansion of a Function in a Power Series 
The following relationship holds true for many functions: 


CE ao +a1x +a2x? -d33^ ++ dax" oe (8.1) 
n=0 

The coefficients have to be evaluated for each function separately. One important 
property of such a series is that it is differentiable. It is a necessary condition but 
not a sufficient one. The function f(x) = e-/ X* for x # 0, for example, can be 
differentiated, but it cannot be expanded in a series. In Sect. 8.3 we will investigate 
the range of values of x for which the expansion is valid. For the time being we will 
assume that such an expansion is possible. 

Consider (8.1). The fundamental assumption is that the value of the function and 
the power series coincide at x = 0 and that the values of their derivatives coincide as 
well. This gives rise to an algorithm for evaluating the coefficients a; of the power 
series. 

Step 0: The function and the series must coincide at x — 0. This gives 


f (0) = ао +a, X O+42 x O4----F ag x 0+- 


Thus ao is known to be ao = f (0). 
Step 1: The first derivatives of the function and the series must coincide at x = 0. 
Obtain the first derivative: 


1 


f'(x) = a14-2aox + Зазх? +-+ nagx" 1 + 
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For x = 0 we get 
f'(0) =a, (ай other terms are zero) 


Thus a, is known to be a, = f"(0). 
Step 2: The second derivatives of the function and the series must coincide at x — 0. 
Obtain the second derivative: 


f" (x) = 202 +3 x2aax ----- n(n — 1jasx" 2 4... 
For x = 0 we get 
f"(0) = 2а5 (all other terms are zero) 


А lg" 
Thus a is known to be a? = 5 f” (0). 
Step n: The nth derivatives of the function and the series must coincide at x = 0. 
Obtain the nth derivative: 


f (х) = n(n-1)(n-2) x --x3x2x1xag4- (n4-1)n(n—10) x: xany1 X+ 
For x — 0 we get 

f")(0)2n(n—1)(n—2)x--x3x2x1xa, (all other terms are zero) 
Hence 


120) 2:0] 
(n—-1)n—2)-.x3x2x1 n! 


Note that n! = n(n —1)(n—2) x --x3x2x1 (т! іѕ read ‘n factorial’) 


f™(0) 


dn = 


I!2121 

21 —1x222 

3!=1x2x3=6 

44=1x2x3x4=24 and soon 
By definition, 01--1 


Definition The expansion of a function f(x) expressed in a power series 
is given by 


f(x) = 700) + OS Оа ah" Ox? 4e 


or, more simply 


f(x) = y Ў" (0) s (8.2) 
п=0 


This is known as Maclaurin’s series. 
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Expansion of the Exponential Function f (х) = ех 
The derivatives are 
Р(х) =е, fi" (x) ген (х) =e" 


Substituting in (8.2) gives 


Е Mo. OF x" 
co n 

ыу (8.3) 
п=0 °°" 


For апу given value of x, the factorial n! increases more rapidly than the power 
function x”; hence the terms get smaller as n increases. 
For x — 1, for example, we have 
1 1 1 1 


1 1 

1 

-1-1 Lus 
e F *t3*6'24'120 720 5040 


so that 
e zz 2.7182... 


Similarly, the expansion for e^ * is obtained by replacing x with —x: 


f(x) = sinx f (0) =0 
Р(х) = соѕх 3-410) d 
f" (x) = —sinx f"(0) =0 
f" (x) = —cosx f"(0) = —1 
Substituting in (8.2) gives 
NT NS 
snx = x 31 51 7| Ї 
"d S түп 1 2п+1 
= 2 О бага” ми 
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Expansion of the Binomial Series f (х) = (a + x)” 


f(x) = (а+х)" f (0) =a" 
f(x) 2 n(a-- x) f (0) = na" 1 
f" (x) 2 n(n— 1)(a x)? f" (0) = n(n - 1a"? 


f(x) =n(n—-1)---(n—k+1)(at+x)"* f*(0) =n(n—-1)---(n—k + 1)a* 


Note that n need not be an integer. Thus the expansion is valid for, e.g. n = 1/2. 
Substituting in (8.2) gives 


n(n—1):-(n—k- 1) 


-1 
(а х)" =a" tna ty 10 1 eg xa. m айк... 
A useful version of this series is when а = 1. We then һауе 
-1 -1)п-2 
ОЕА ea, (8.5) 


2! 3! 


Expansion of the Function f(x) = т 


We know the result already because this is the sum of a geometric series. 


ге) = 700) =1 
fosy 791 
го) к 77002 
о р 770)-3 


п! 


F'O) = Gynt f"(0) =n! 


Substituting in (8.2) gives the familiar result 


1 оо 
=S HxH БХ Бэ at pers Sox а) (8.6) 
ии п=0 


(Absolute value of x < 1) 
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8.3 Interval of Convergence of Power Series 


There are functions for which Maclaurin’s series converge for values of x within 

a certain range. This is the case for the geometric series. It is only convergent pro- 

vided that —1 < x < 1. This range is referred to as the interval of convergence. 
Consider the power series 


ao +a1x pasx^ tase? teagan” +an41x" sos 
The coefficients are numbers independent of x. The series may converge for certain 
values of x and diverge for other values. 
We wish to find the range of values of x for which the series converges, i.e. the 
interval of convergence. We form the ratio 
Gaya 
an x" 


assuming that all coefficients a; are non-zero. Now consider its limit: 


. ан-1 |x| 
lim x| = — 
n— o an R 
. a 
where А = lim Ё (8.7a) 
n9 |üpn41 


R is called the radius of convergence. 

The series is absolutely convergent if |x| « R and divergent if |x| > R. Hence 
a power series is convergent in a definite interval (— А, А) and divergent outside this 
interval. This is illustrated in Fig. 8.2. 


Divergent Convergent | Divergent 


-=R Ò +R 
Fig. 8.2 


Another well-known formula for computing R is due to Cauchy and Hadamard. 


It is 1 
R^ lim V/|a | (8.7b) 


n-—oo 


The formula is also applicable if some coefficients a; vanish, e.g. in the trigonomet- 
ric functions. 


Example Consider the power series 


2 3 4 n 
X^ OG X n-1* 
tap got 


ESSEN ae 


8.4 Approximate Values of Functions 233 
Using (8.7a) we obtain 


—(n+1)? 
23 


R= lim 


И оо 


= lim 


n—oo 


= 
Яп+1 


Hence the series is convergent if |х| < 1 and divergent if |x| > 1. 
Example Consider the exponential series 

oo = 

е = У 

п=0 597 


n 


Using (8.7a) we obtain 


R= lim 


И оо 


1)! 
= ува 


n-—oo n! И оо 


dn+1 


Hence the series is valid for all values of x; the radius of convergence is со, 


8.4 Approximate Values of Functions 


It is easier to handle a finite number of terms of a series than an infinite number. In 

a convergent series the terms tend to zero; the evaluation of a power series can there- 

fore be broken off after a certain number of terms. Where we break it off depends 

on the accuracy required. It is important then to be able to estimate the error. 
Consider the series 


f (x) = ag - a1x +agx? анх": 
Let us divide the series in two parts so that 


1 


f(x) = ао +aix+azx? ау Нара Mie 
Гал Бин su Uer ttes MEER ciat 
Approximate polynomial Remainder 


Pn (x) of degree n Rn(x) 


The first part represents the approximate value of the function f (x), and the second 
part the remainder. If we take the polynomial of degree п as an approximation of 
the value of the function f(x), it follows that the error is equal to R (x), the re- 
mainder. This remainder is an infinite series and if we can estimate its magnitude 
we automatically have an estimate for the value of the error. 

To appreciate the behaviour of approximations, let us consider graphically the 
sine function taking one term, then two terms, and so on, of the power series for 
sin x. 
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First approximation , sinx z x 
x? 
Second approximation , sin x & x — A 
: ЖЭ? ee ae 
Third approximation , sinx z x — 5 T 120 


The first approximation is represented by a tangent at the position x = 0 (Fig. 8.3a). 
We can see that the error builds up rapidly after x ~ 7/6. 

The second approximation replaces the sine function by a polynomial of the third 
degree (Fig. 8.3b). The range of values of x for which the approximation is suffi- 
ciently exact is greater. 

The third approximation replaces the sine function by a polynomial of the 5th de- 
gree (Fig. 8.3c). The range of values of x for which the approximation is satisfactory 
is much larger, e.g. consider an extreme case: 


sin Z = sin 90° ~ 1.00452, error = 0.00452 


Fig. 8.3 
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In practice, we would have to decide what error we could accept, and this depends 
very much on the nature of the problem. The above example does illustrate the point 
that a polynomial can give a good approximation to the value of some other function. 

Lagrange succeeded in estimating the error when the first terms of a series are 
used to calculate the value of a function. He showed that the terms neglected can be 
represented by the expression 


(n1) 
Rn(x) = FE est (8.8) 


This expression contains the (n + 1)th derivative of the function at some value of ё, 
which lies in the interval 0 < € < x. There exists a value of £ = & for which the 
remainder is a maximum. The error cannot be greater than Ry (£o). 


Example Suppose we stop the series for the exponential function after the third 
term. Then the remainder for x — 0.5, say, is 


g (0.5)* 


R3(0.5) =e 4-1 


As e* is monotonic increasing, we get the maximum value of the remainder with 
£o = 0.5: 


R3(0.5) = 


0.5 4 
057 ~ 0.004 


The error, in this case, will be less than 0.004 if we stop at the 3rd term of the 
expansion. 


8.5 Expansion of a Function f(x) at an Arbitrary Position 


It is often useful to expand a function at a position xo which is different from zero. 
To obtain such an expansion we could proceed as in Sect. 8.2, but instead we intro- 
duce a new variable, и = x — xo. Since this auxiliary variable is zero at x = Хо, we 
can expand the function at the position u = 0 in terms of ascending powers of u and 
afterwards express the expansion in terms of x. Hence we proceed as follows: 

Since the function is to be expanded at x = xo, we introduce a new variable, 
и-Х-Хо. 

We resolve in terms of х; х = и + хо and substitute the expression и + xo for 
xin f(x): 

f(x) = f(u+ x0). 


We expand at the position и = 0 with respect to и to obtain 


n 
X 
2! n! 


f (x) = f (uo 4- xo) = f (xo) + f'(xo)u + 
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Now we replace и by x — хо so that 


f(x) = f (xo) + f'(xo)(x — xo) + 2! (х= xo)? +: 
бо) хуулан (8.9) 


This type of series is known as Taylor’s series. 


Note that the geometric meaning of the substitution и = x — хо is a transforma- 
tion of coordinates; the variable и has its origin at x = хо. By means of this shift, we 
are back to the previous situation when we expanded a function at a position where 
the abscissa was zero. 


Example Expand the cosine / (х) = cosx about the point хо = л /3 or (60°). 
Differentiating gives 


f'(x)--sinx, f"(x)=—cosx, f”(x)= sinx 


"(л v3 n ( m (7 V3 
i= PIG) : f (3-5 


and so on. 
Substituting in (8.9) gives 


coreg -(2- 5) (8-5) at um 


Suppose we wish to calculate the value of cosine 61° without using tables. Then 


cos 61? = 


V3 л л УЗ m 
: 211) xm) ^ 200 


T 
If we use two terms only 


o 1 wee 
сов61° а 5 — 17 (150) = 0.5000 — 0.01511 0.48489 
cos (£ + 2415 л \ nti 
wer y= SEH PÓ 


Note: The ( + 1)th derivative of cos(x) is cos (x + (n + 1) /2л). 
Also, 1/3 < £ < 1 /3+ 1/180. 
In this case the error is 


< : Gy = 0.00015 
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since 


(The actual error is 0.00008.) 


Example Expand the function f (0) = (1 — asin? 0)!/2. 

This expression is important in the study of the slider crank mechanism as used 
in the car engine. 

To expand this function, we could use Maclaurin's or Taylor's series. However, 
a moment's thought leads us to conclude that we should use the binomial expansion 
instead. We saw earlier that 


(n — 1) n(n — 1)(п– 2) 
d 3! ge 


With x — —asin? 0 and n — 1/2 we have 


(+x) = ЦОГ Ус 


1(-1 1) (_1) (_3 
(1 asi)? = 1— Last + 2C 3 rato (C33) ? sin® 9... 
2 3 


1 1 1 
= 1 — 5asin^6 — za sint 8 — теа sinf 8 — «« 


8.6 Applications of Series 


At the beginning of this chapter we mentioned briefly the important applications of 
series. The numerical values of trigonometric, exponential, logarithmic and many 
more functions are computed by means of series. The values found in tables were 
first computed by hand a very long time ago: an extremely tiresome task! Today 
computers make the task far easier and furthermore, error is reduced. 


8.6.1 Polynomials as Approximations 


The expansion of functions as infinite series has a special significance in calculating 
approximate values. With a rapidly convergent series and with small values of x we 
only need to take the first two or three terms of the expansion; in some cases the 
first term is adequate. If we replace the function by an approximate polynomial, the 
mathematical expression may be simplified considerably. 


Example The atmospheric pressure р is a function of altitude h and is given by 


p= poe "^ 
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ро and а are constants, po being the pressure when h = 0. To calculate the pressure 
difference we have 


Ap — p — po — po (er -1) 


This expression can be simplified by using an approximation. 
Sincee * = 1—x+---, as a first approximation to the pressure difference, it 
follows that 
Ap & po(1 — o — 1) = — poo 


Suppose we want to calculate the altitude h when the pressure p is decreased by 1% 
of the pressure at h = 0, i.e. 


21 1 
Ap/po= Tog? = 0-121 x 10:23 


We have 
Ap 1 1 


Dox 100 ^ 0.121 x 103 
The error is 0.4m or 0.49% of the true value. 


h= = 82.64m 


Example Detour problem. Figure 8.4 shows two possible paths that can be taken 
when travelling a distance S from A to B, a direct one and an indirect one via C. 
The problem is to find how much longer is the detour via C than the direct path? 


C 


A B 


E ——— ————————— 5 ———————— 
Fig. 8.4 


Let u be the detour. If h is the height of an assumed equilateral triangle, then, by 
Pythagoras' theorem, we have 


2h\? 
= 5 1 — | -1 
u «(S 


To investigate the behaviour of и as a function of h, it is much simpler to express it 
by an approximate polynomial. Using the binomial expansion, we have 


gere (4(9)) iG) 008) 
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Provided that h < S, we can use a first-degree approximation by taking the first two 


terms of the series: j 
1 (2h 
x1l4+=(— 
fü) 2163 


Substituting in the equation for u gives 


1 /2h\? 2h2 
518) )- 9 


As an example, let S = 100km. The function is shown іп Fig. 8.5. An examination 
of the graph shows, e.g. that when h = 5km, the detour is only 0.5 km. 


(km) 


Fig. 8.5 


Example Obtain a closer approximation for one of the roots of the equation 
x* — 1.5x? +3.7х – 21.554 50 


A rough estimate gave x — 2.4. 
Let x be a rough approximation for the root of an equation found by trial and 
error. If the true solution is x +h, then, by Taylor's theorem, we have 


0— f(x+h) = f(x) +h: f'(x) 
Solving for h gives 


f(x) . PERD? 
hz —————.; hence x— ——- isa better approximation. 
f'(x) f'(x) 
This is also known as the Newton-Raphson approximation formula (see Chap. 17). 
Returning to the example, we find 


Р(х) = 4x3 —4.5x743.7 and /'(2.4) = 33.076 
Also f (2.4) = —0.2324 


It follows that h = 0.2324/33.076 = 0.007. 
A more accurate approximation is x = 2.4+ 0.007 = 2.407. 
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8.6.2 Integration of Functions when Expressed as Power Series 


We often encounter integrals whose integrands are complicated functions. This 
makes their integration extremely difficult or even near impossible. If the function 
to be integrated can be expressed as a power series, then we can integrate it term by 
term within the interval of convergence. In this way we can solve practical problems 
more easily. This is illustrated by the following examples. 


Example The function e is known as the Gaussian bell-shaped curve. It is sym- 


metrical about the y-axis. In statistics and the theory of errors the dispersion of mea- 
sured values about a mean is described in terms of a function of a similar type. We 
wish to compute the integral 
x 
2 
Í e dt 
0 


This corresponds to the area under the curve between г = 0 and t = x (Fig. 8.6). 


Fig. 8.6 


It is not possible to evaluate this integral as it stands; instead we replace it by 
a power series. Remember that 


2 x3 4 


SCIAT AT. 
Е 2 6 24 
Substituting —/? for x gives 
4 46 48 
-12 2 t t t 
-1-1 H 
2 UE ue UA N 


t 


Substituting fore ^ in the integral and integrating term by term gives 


7 9 


(еа = АЕ ИЗ 
Ü =X- 3 to R216 


As x — œ the integral has a limiting value. This value is given here without proof: 


88242 ул 
dt = Уг 
| j 2 
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Fig. 8.7 


The total area under the bell-shaped curve (Fig. 8.7) is then 


ge dt = x 


—oo 


It is useful to normalise the curve so that the area under it is equal to unity, i.e. 


ee 
| 8-4-1 
22577 
In the next example we will use the following statement which is given with- 
out proof. 


Multiplying two power series is valid within the interval of convergence if 
they are both absolutely convergent. 


Absolute convergence means that the sum of the absolute values of the summands 


converges as well. 
04 | 4_ x2 
| x dx 
0 4--4х3 


First we express the integrand as a product, i.e. 


Be en) nre 


The binomial series converges for |x| « 1. The condition is satisfied in the case of 
our two functions. The expansions are 


1/2 

x2 1 1 1 
px (5) E 2 4 6 
( 2 ) 8" 128° 104 


(+x?) ете 


Example Evaluate 
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Multiplying the two series gives, for the integrand 7 


1 1 1 1 
I=1 х? x? 44. x9 4 252 
8 2 128 16 1024 


Integrating term by term we find 


0.4 | ne ца 


І 
Idx = 5 x 
4 $ 3 3 "50" ORY таев 1]. 


0.4 
" Idx ғ 0.4 — 0.00267 — 0.00320 — 0.00002 + 0.00004 + 0.00009 
0 
& 0.3942 


8.6.3 Expansion in a Series by Integrating 


The expansion of a function in a series can sometimes be achieved by expanding its 
derivative first and then integrating term by term. 


Integrating a convergent power series term by term is valid. 


Example Obtain a series for (ап! x. 


We know that 
1 dx 


= J 1432 

If we expand the integrand in a series and integrate term by term we will obtain 
a series for (ап! x. 

Expanding the integrand by means of the binomial theorem gives 


tan 


1 
14-12 - (1-2) 3 Expose e IE Pei 
which is convergent for |x| « 1. 

Hence we have 


M tess раа (Ix <1) 
—— = tan = ... x 
o I4x2 REM AUC AMET eG = 
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Appendix: 


Commonly Used Approximate Polynomials 


This table contains a number of typical functions, together with the first terms when 
expanded as power series. These expansions can be used to obtain approximate 
values for the functions. The range of values of x for which the approximations 
are valid are given; these are based on the error being smaller than 1% and 10%, 
respectively. 


Table 8.1 Approximations for Typical Functions 


Function First approximation Second approximation 
Error less than Error less than 
1% 10% 1% 10% 
for x =0 for x =0 for x =0 for x =0 
tox = tox = tox = tox = 
x3 
sin x x 0.24 0.74 X= 31 1.00 1.66 
2 2 4 
х х х 
cosx L- EH 0.66 1.05 1— ata 1.18 1.44 
x3 
tanx x 0.17 0.53 xt 0.52 0.91 
x? 
e* 1+х 0.14 0.53 1+х+ F 0.43 1.10 
x2 
In(1+x) х 0.02 0.20 fons 0.17 0.58 
х»-1 
х x x? 
1-х 1+ 2 0.32 1.42 1+ NES 0.66 1.74 
|x| « 1 
1 x x 3 
-- 0.16 0.55 1---42x? 0.32 0.73 
V1+x 2 2 8 
|x| <1 
1 2 
= 1--х 0.10 0.31 1+x+x 0.21 0.46 
—x 
|x| « 1 
1 
Tro 1 0.31 056 14+x?4+x4 0.46 0.68 
-Х 
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Table 8.2 Power Series of Important Functions 


This table is included for further reference. It contains some expansions which have not been 
discussed in the text. In some cases negative powers occur. 


COS X 


tanx 


cot x 


sec х 


sinhx 


coshx 


tanhx 


cothx 


sinh! x 


sinh! x 


cosh“! x 


tanh7! x 


14 x 1 KP 1 x? 1 
"eds олауга раа 
x3 x? х7 
feet cm 
1 x? xí хб 
2! 41 6! 
25,22 17х7 | 62x? 
3 15 315 2835 ` 
1 x x? 2x5 х! 
x 3 45 945 4725 
q,X5,5x* | 61x? | 1385x9 
ar" ay 561^. 38b — 
n x? 3x? (3 x 5)x7 
2x3 2х4х5 2x4x6x7 
Z -sinx 
x ue. x? xtA. x? 
3 5 7 9 
—-—tan Їх 
x3 x> x7 
ede Kae TRADE T 
1 x? xí хб 
2! A4 6! 
p 0,2 Vm, 62x? 
3 15 315 2835 
1 Е: x? 2x? х! , 
х 3 45 945 4725 ` 
1xx? (1x3)x? (1хХ3х5)х7 
2х3 2x4x5 2х4х6х7 ` 
(In2)x 4 1 1x3 | 1x3x5 
(2х2)х2 (2x4x4)x* (2x4x6x6)x9 
1 1x3 1x3x5 
(In2)x 
(2х2)х2 (2x4x4)x* (2x4x6x6)x9 
"EA L x? L xt. 


(x| « 1) 
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Table 8.2 (continued) 


Ш 1 1 1 1 
coth ! x 25 3x3 | 5x5 47 Eu (|x| > 1) 
2 3 4 
In(l+x) =x 5 E = Еу (-1<x<1) 
n(n—1) п(п– 1)(п– 2 
(1+х)" =1+nx4 2 24 Ml 3 13 
n(n—1)---(n—k+1)x* 
| ) р Ee (x| « 1) 


This last formula is valid both for integral exponents and for fractional exponents. 


Note: There is no expansion for cot x and for coth x at x = 0 because these functions have a pole 
at this value. The series expansions were obtained by expanding x cot x and x coth x and dividing 
the result by x. 


Exercises 


8.2 Expansion of a Function in a Power Series 


1. Expand the following functions at xo = 0 in a series up to the first four terms: 


(a) f(x) 2 V1—x (b) f (t) =sin(wt+ л) 
(c) f(x) —In[(14-x)?] (d) f(x) = соѕх 
(e) f(x) =tanx (f) f(x) = coshx 


8.3 Interval of Convergence of a Power Series 


2. Obtain the radius of convergence of the following series: 


(a) f(x) =sinx = b wee 


оо 


1 
(b) f(x) = = X3 


| .1-3x 450 


8.4 Approximate Value of Functions 
3. Sketch in the neighbourhood of xo = 0 the function f (x) and the graphs of 
the approximate polynomials P4 (x), Р(х) and Рз(х). 


Е ur 
(а) y —tanx (b) = EET 
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8.5 Expansion of a Function at an Arbitrary Position 


4. 


5. 


6. 


Expand the following functions at xo = л: 


(a) y=sinx (b) y =cosx 
Expand the function f (x) = Inx at xo = 1. 
4 
Expand the function f (х) = ——— at xo = 2. Obtain the first four terms. 


1-3х 


8.6 Applications of Series 


T: 


10. 


11. 


12. 


13. 


Determine the intersection — which lies in the first quadrant — of the func- 
tions e* — 1 and 2sin x. Approximate both functions by a polynomial of the 
third degree, P3(x). 


. Calculate у 42 = 4/36 + 6to 4 d.p. 


. Replace the function f(x) by an approximate polynomial in the interval 


(0, 0.3). The error should not exceed 146. 


(a) f(x) =In(1 +x) 0) f() = 4 — 


Given the functions f(x), compute approximately (see Table 8.1) the value 
of f (1/4). The value obtained should have an accuracy of 10%. 


(a) f(x) =e* (5) f(x)ein(1--x) (o f(x)=vI+x 
Let the series for the function f(x) = 5 anx” be given. Obtain a series 


n= 
expansion for the integral f f(x) dx by integrating the series term by term 
for the following functions: 


1 со 
(a) f(x)= Ёс Y (-1/х5-1-х--32-х2--54--- (|х| <1) 
-0 
(geometric series) 
co 2n 2 4 6 
m 2 n* юР x 2 2 Бал 
THOR Os KS pag а age epe 


Solve the following integrals using a series expansion: 
0.58 
(a) | У1--х24х 
0 
* sint 
(b) ГА ЕР. dt (Integral (b) cannot be evaluated by any other method.) 
0 


(a) Obtain a power series for sin~! x by first expanding 1 / V1 — x?, which 
is the derivative of sin"! x, and, second, integrating term by term. 

(b) Since sin! (1) = 1/2, by inserting x = 1 into the series one obtains 
a series for л /2. Compute the value of this series up to the fifth term 
and compare with the correct numerical value. 


Chapter 9 
Complex Numbers 


9.1 Definition and Properties of Complex Numbers 


9.1.1 Imaginary Numbers 


The square of positive as well as negative real numbers is always a positive real 
number. For example, 32 = (—3)? = 9. The root of a positive number is therefore 
a positive or negative number. We now introduce a new type of number whose square 
always gives a negative real number: they are called imaginary numbers. 


Definition The unit of imaginary numbers is the number j with the property 
that 
p2-1 (9.1) 


The imaginary unit j corresponds to 1 for real numbers. 


An arbitrary imaginary number is made up of the imaginary unit j and any real 
number y; thus yj is the general form for an imaginary number. 

We know that we cannot extract the root of a negative number when dealing with 
real numbers; nevertheless, we can factorise the root of a negative number thus: 


V —5 =  /5(-1) = V5x 4-1 


Since j? = — 1, it follows that j = V~T; hence V5 x /—1 =jvV5. 
The root of a negative number is an imaginary number. Moreover, with i = –1, 
we can simplify higher powers of ј. 


К. Weltner, №. J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
ISBN 978-3-642-00172-7 © Springer 2009 


248 9 Complex Numbers 


The ordinary rules of algebra extend to imaginary numbers. In addition one must 
remember that j = —1, ie. 


j'=j 
jf=-l 
p=? xj=-j 
jaf хр = 


9.1.2 Complex Numbers 


The sum z of a real number x and an imaginary number jy is called a complex 
number, complex meaning ‘composed’. Thus 


2=х+ју 


where x is the real part of z апа у is ће imaginary part of 2. 
If we replace j by —j, we obtain a different complex number z* given by 


zt =x—iy 


z* is called the complex conjugate of z. 
A complex number has the value zero only if both the real part and the imaginary 
part are zero. 


9.1.3 Fields of Application 


The most obvious property of imaginary numbers is that we can ‘extract’ the root 
of a negative number, which means we obtain an expression which we can handle. 
This property enables us, in principle, to solve equations of any degree. Consider, 
for instance, the quadratic equation 


ах -фх-х-0 


We have seen that it is unsolvable in terms of real x if b? < 4ac, i.e. when the 
radicand b? — 4ac becomes negative. The solution in terms of complex x is 


х12 = x (—b tiv/4ac - 2) 
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Complex numbers are important in the solution of differential equations which are 
dealt with in Chap. 10. They are also a useful concept in electrical engineering, and 
they are indispensable in the study of quantum physics. 


9.1.4 Operations with Complex Numbers 


When handling complex numbers there are two rules to remember: 
The complex number x + jy is zero if and only if x = 0 and y = 0. 
Complex numbers obey the ordinary rules of algebra, in addition Д? = –1. 


Addition and Subtraction of Complex Numbers 


Rule The sum of complex numbers is obtained by adding real and 
imaginary parts separately. 
The difference of complex numbers is obtained by subtracting real 
and imaginary parts separately. 


Let Example 
zy —-xitjyi zj —6-4 Tj 
22 = X2 4-1У2 Z2=3+ 4j 
Then their sum is 
2122 = (х1 +jy1) + (x2 +jy2) zi +22 = (6+7j) + (3 + 4j) 
= (x1 x2) +ј(у1 + y2) =9+11j 
Their difference is Example 
zj = 22 = (x1 — x2) + 1 У2)) 21—22—34 3j 


Product of Complex Numbers 


Rule The product 21,22 of two complex numbers is obtained by simple 


multiplication of the terms, taking into account j? — — 1. 
General expression Example 
2122 = (хі +ју1)(х2 +jy2) 2122 = (6+ 7j)(3 + 4j) 
= x1 x2 +jx1 y2 +jy1x2 ti? y1y2 =18+24j+21j— 28 


= (x1 x2 — y1 y2) +j(x1 y2 + x271) =—10+45j 
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Division of Complex Numbers 


Rule Division of a complex number by another complex number is car- 
ried out by multiplying numerator and denominator by the conjugate 
of the divisor to transform the latter into a real number. Note that the 
denominator then appears as the sum of two squares. 


General expression Example 
z xctja _ (хі +jy1)(*2 —jy2) 1) 6-7) (6+7)(3— 4j) 
22 Х241Уу2 (x2+jy2)(x2 —jy2) 22 3+4) (3-4)83-4) 
_ Qaxa + yiy2) - joya — y1x2) _z10-3_ -2 3. 
x32 F y? 25 5. 5l 


The conjugate of 2218 22 = x2 — jy2 


9.2 Graphical Representation of Complex Numbers 
9.2.1 Gauss Complex Number Plane: Argand Diagram 


The complex number z — x 4- jy can be represented in an x — y coordinate system 
by placing the real part along the x-axis and the imaginary part along the y-axis in 
a similar way to the components of a vector. Figure 9.1 shows this. 

We obtain a point P(z) in this plane which corresponds to the complex number z. 
This plane is called the Gauss number plane, better known as the Argand diagram. 
In this way we have produced a geometrical picture of a complex number. 


Example Where is the point in the Argand diagram corresponding to the complex 
number z — 4 — 2j? Figure 9.2 shows the answer. 


P(z) = (x,y) 


Fig. 9.1 
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Addition 
23 = 21 Z2 


If zı = 6+ 3j and z2 = 2 + 5j we know that the sum 23 = 8 + 8j. This addition may 
be represented in an Argand diagram, as shown in Fig. 9.3. 

To obtain the complex number 23, we first draw the two complex numbers 21 
and 22 as vectors as shown. Then we shift vector 22 parallel to itself in such a way 
that its tail is made to coincide with the tip of vector 21. The tip of vector 22 locates 
the tip of the required vector or complex number z3 whose magnitude is given by 
the length OP. This construction is based on the parallelogram rule for the addition 
of vectors. The same result is obtained if we shift 21 instead of 22. We dealt with 
vectors in Chap. 1, so this construction should not be new to the reader. 

To add more than two complex numbers we draw a polygon by joining the vectors 
tip to tail. The tip of the closing vector represents the required complex number. 


7 
6 

5 

4 

3 

2 

1 

0 2 4 6 8 х 
Fig. 9.3 

Subtraction 


Z3 = Z1 — Z2 = Z1 + (—22) 


The problem of the subtraction of two complex numbers can be transformed into 
one of addition if to one complex number we add the negative of the other. 


9.2.2 Polar Form of a Complex Number 


Instead of specifying a complex number by means of the coordinates x and y, we 
could specify it by means of a distance r from the origin of the coordinates and an 
angle о, as shown in Fig. 9.4 x and y are the Cartesian coordinates, and r and o are 
the polar coordinates. 
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Fig. 9.4 


From the figure we see that 
x =rcosa 
y=rsina 


Substituting in the expression for the complex number, z = x +jy, gives 
z =r(cosa +jsina) (9.2) 


This expresses a complex number in terms of the trigonometric functions. It fol- 
lows that the conjugate z* of the complex number z is 


z* =r(cosa —jsina) 


If we know r and o we can calculate x and y from the above equations. If on the 
other hand we know x and y and we want to express a complex number in polar 
form, it follows from Fig. 9.4 that 


r = /x2-- y? taking the positive value only 


a = tan! (2) 
x 


г is known as the modulus and а as the argument. 


Table 9.1 illustrates the values taken by о according to the sign of x and y. 


Table 9.1 
x y tano P(z) lies in æ in the range 
Эр zt Зе л 

роѕійуе positive positive 1st quadrant 0 to 2 

) $i : л 
negative positive negative 2nd quadrant - юл 

1 : ae 3л 
negative negative positive 3rd quadrant 7 to > 

Зл 


positive negative negative 4th quadrant 5 to 2л 
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Example Express the complex number z = 1 — j in polar form. 


z = l —j means that x = 1 and y= – 1. 
Hence r=Vx2+y2=V1i4F1=Vv2 
-1 
а —tan l| — zm а га ог AE 
1 4 4 


We represent the complex number in the Argand diagram (Fig. 9.5) and find that it 
lies in the fourth quadrant. It follows then that 


7л 
4 
Thus the complex number 2 = 1 —j can be written in polar form: 


z= Vi (cos E ээн) 


[04 


Example Express ће complex number 2 = 6j in polar form. 
In this case, z = x 4- jy = 0 + 6j, i.e. x = 0. Consequently r = 6 
араа i.e ga or zu 
Br ar суха 2 2 
To determine о, the complex number is shown in Fig. 9.6. It follows, therefore, that 
a = 5. In polar form the complex number is 


€ 
2 = cos — sin — 
3 news 
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9.3 Exponential Form of Complex Numbers 


9.3.1 Euler’s Formula 


It is also possible to express a complex number in the form 
z = rel? (9.3) 
As before, г is the modulus of z and о is the argument. We can show that it is 
equivalent to the polar form of a complex number: 
z=r(cosa+jsina) 
In other words we want to prove that 
rei” = r(cosa+jsina) 


Dividing by r gives 
el” = cosa +jsina 
In Chap. 8, Sect. 8.2, we showed that the expansion for e* is 


2 x3 4 


x24 x x 
е = A y gg T 
This expansion remains valid if we replace x by јо. Remembering that j? = — 1, we 
obtain 
enun o? a? at af 
da x cx ua тИ 
We also showed that 
: a? ae 
sino = a — ЗГ + 5 
Hence 
g^ us 
jsina = ја ET TJ 51 
a? at 
pose l- epic 
Comparing these expressions, we obtain 
РО a? aor a’ udi 
cosg +jsina = 1 4 jo 21 251 + 41 cJ ET “=e 
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Euler’s formula 


е! = cosa +j sina (9.4) 


This enables us to express a complex number in a third way. 
A table giving the various forms in which complex numbers can be expressed 
will be found in the appendix at the end of this chapter. 


9.3.2 Exponential Form of the Sine and Cosine Functions 


The conjugate complex number of е! is obtained by replacing | by —j, so that if 
z =e! then z* = е-е. 
Now we will try to express the sine and cosine function in terms of еі“. 
According to Euler’s formula, we have 


e!* = cosa +j sina 
e J” = cosa —j sina 
Adding both equations gives 


cosa = = (eJ* +e”) = coshja 


Nile 


Subtracting both equations gives 


l,; А 1 
sina = —(e!* —e 7*) = = sinhja 
2j j 


9.3.3 Complex Numbers as Powers 


Given the complex number z = x +jy, we wish to calculate the modulus and argu- 


ment of 


w = е? 


Substituting for 2, we have 
ш =el* HY) = eel? = rel 


Comparing the last two expressions, it follows that r = e* and ју = ja. Hence the 
modulus of w is e*, and the argument of w is y. 


Example If z = 2 +јл/2, calculate ш = е2. 


w=e= e (2732/2) = e2e17/2 
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Hence the modulus is r = e? and the argument is о = л /2, and we have 


5 Told Z) = је? 
ш =е (cos T. +-j sin 5 je 


The solution is shown in the Argand diagram (Fig. 9.7). 


Fig. 9.7 


Example If z = —2+j37/4, calculate w = е2. 


3m E 
w= C25 4 ) — ee 
Зл 
-2 
= а = — 
г = е and а= 1 
3 3 
Hence w=e? (cos = +j sin E 


MIL 


This solution is shown in the Argand diagram (Fig. 9.8). 


Fig. 9.8 


E) 
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Suppose now that z is a function of some parameter f. The simplest case is that 
of the linear function where a and b are constants. 


z(t) = at +jbt 


One important interpretation of f in practice is the time, so that z(t) grows with 
time, i.e. the real part and the imaginary part grow with time. 

Substituting for z in w = e? gives w(t) = e(4! +i!) — eat gib! 

Using Euler's formula, we get w(t) = е (cos bt +jsinbt) 

To examine the behaviour of this function, we can consider the real and the imag- 
inary parts separately and represent each one graphically as a function of the time t. 

The real part of w(t) is е cosbt. It is the product of an exponential function 
and a trigonometric function of period p = 2л /b. 

If a is positive then w — e^' cosbt represents a vibration whose amplitude grows 
exponentially with time, as shown in Fig. 9.9. 


Fig. 9.9 


If a is negative then w = e^! cosbt represents a vibration whose amplitude de- 
creases exponentially with time; the vibration is said to be damped and is shown 
below in Fig. 9.10. 


Fig. 9.10 
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The imaginary part of w(t) 18 e” sin bt. It is also the product of an exponential 
function and a trigonometric function whose graphical representation is similar to 
Fig. 9.9 or Fig. 9.10, depending on the sign of a. 

The mathematical solution of vibration problems is often simplified by means 
of complex numbers. Thus, in solving a practical problem, we start by considering 
the physical situation consisting of real quantities. We then perform all calculations 
using complex numbers and finally consider and interpret the results of the real and 
imaginary parts. 


9.3.4 Multiplication and Division in Exponential Form 


Addition and subtraction of complex numbers are best carried out using the form 

Zz —x-4 jy. Multiplication and division, on the other hand are best carried out by 

expressing the complex numbers either in exponential form or in polar form. 
Consider two complex numbers 


zı —rje*! and z2 = rze®2 


Multiplying gives 
z = 2129 = rye! r9el2 = p ro 1 t02) (9.5) 
Here we use the power rule а"а”! = a"?™, 
Dividing gives 
22 Zi reli = rı йо 90) (9.6) 


Z9 mej% гә 


Here we use the power rule a" /а”! = a” 7". 


Rule To multiply (or divide) complex numbers, we multiply (or divide) 
the moduli and add (or subtract) the arguments. 
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9.3.5 Raising to a Power, Exponential Form 


We have 
z” = (re? = ре)" (9.7) 


Rule То raise a complex number to a given power we raise the modulus 
to that power and multiply the argument by that power. 


Figure 9.11 shows the points z and z? in the complex plane with r — 2. 


Fig. 9.11 


1/” we have 


21! = ут = Vreit = үед!" (9.8) 


In the case of z 


Rule To extract the root of a complex number, we find the root of the 
modulus and divide the argument by the index. 


9.3.6 Periodicity of rei 


We should like to mention the fact, perhaps surprising to the reader, that the complex 
number 


2 = ге) 


is identical to z = rel ^27) 
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Fig. 9.12 


If we examine Fig. 9.12 we can see that the same point P(z) is obtained whether 
the angle is o ого + 27. In fact we could equally take the angle to be o + 4л, о + 
67 ,a — 21,0 — 4л, etc. Hence, generally 


геј = rele 2zk) 
where k =+1,+2,+3 etc. 
In particular we have 1 = ef?” 


9.3.7 Transformation of a Complex Number From One Form 
into Another 


Transformation from the Algebraic Form into the Exponential Form 


The transformation from x + ју into re!” is based on the relationships derived in 
Sect. 9.2.2, Le. 


r=V/x2+y2, tana= л 
х 
Example Convert the complex number z = — ү/5 + 2j to the exponential form. 
r = \/(-V5)?4+22 =3 
t : 0.894 
ana = —- = 0). 
-v5 


The angle is in the second quadrant; therefore о = 138.19? or 0.7687 radians. 
Transformation of the Exponential Form into the Algebraic Form 


Since z = ге} =r(cosa+j sina) = x+jy 
then x =rcosa 


y=rsina 
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0.54-1.3j) 


Example Convert the expression z — el to the algebraic form. 


0.5-1.3) _ 20.51.3) 


2 = е 
х = e-5) cos 1.3 = 0.441 


y =e) sin 1.3 = 1.589 


Hence z = 0.441 + 1.589) 


9.4 Operations with Complex Numbers Expressed іп Polar Form 


In the previous section, the rules for operations with complex numbers have been de- 
rived using the exponential form. We will now show that the same rules are obtained 
if we express the complex numbers in polar form. We make use of the addition the- 
orems for trigonometric functions given in Chap. 3, Sect. 3.6.6. 


9.4.1 Multiplication and Division 


Let zı and 22 be expressed in polar form so that 


Z1 = ri(coso +j sing) 


Z2 = r2(cos œz +j sino) 


Multiplication 


Z1Z2 = rir2(coso +j sina, )(cosæ2 +j sina) 


= rir2[(coso1 cos@2 — sino sino) + j(sino cosa + cosa sina2)| 


Using the addition formulae for the sine and cosine functions, 


Z1Z2 = rira2|cos(o + a2) +j sin(a; + o2)] 
= r(cosa 4- j sina) (9.9) 
Thus the modulus r of the product equals / 172, and the argument o is (от -- 02). 


This is exactly in accordance with the rule derived in the previous section: to 
multiply complex numbers we multiply the moduli and add the arguments. 
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Figure 9.13 illustrates geometrically the multiplication of complex numbers. 
Draw a triangle OPP, similar to the triangle OP2Q (О has coordinates 1, 0), then 


2 
— = —, hence 2 = 2122 


Also o = angle ООР = o + a2 


Yip Р(2 = 2122) 


Fig. 9.13 


Division 


Both the numerator and the denominator are multiplied by the conjugate of the 


divisor. Ре " 
zı ri(coso +-јѕіпат) го (соѕо2 —jsino) 


Z2  r3(cosao 4 jsino) r2(cosa2 —jsina2) 
Using the addition formulae for the sine and cosine functions, 
21 


a [cos(a@1 — 0/2) +jsin(a1 —a2)| (9.10) 
2 Р 


Thus the modulus of the quotient equals нь and the argument is (01-02). 

Again this is in accordance with the rule derived in the previous section: to divide 
complex numbers we divide the moduli and subtract the arguments. 

Figure 9.14 illustrates the division of complex numbers by a reasoning similar to 
that for multiplication. 


Fig. 9.14 
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9.4.2 Raising to a Power 


We saw earlier that z1z2 = rir2[cos(a1 -- 0/2) +j sin(o1 + o2)] 
If we now let z1 = 22 = z,r1 = r2 =r anda, = a2 = a it follows that 


22 = r? (cos 2a +j sin2a) 
Similarly 23 =r?(cos2a+jsin2a)r (cosa +jsina) 
= r3(cos3a +j sin3a) 


The general expression is 


z” <=r"(cosna +j sinna) (9.11) 


Rule To raise a complex number to a given power, we raise the modulus 
to that power and multiply the argument by that power. 
By setting r = 1 we have 
(cosa +jsina)” = соѕпо + jsinno 


This is known as De Moivre’s theorem. 


9.4.3 Roots of a Complex Number 


De Moivre’s theorem holds true for positive, negative and fractional powers. We 
can, therefore, use this fact to determine all the distinct roots of any number. 
Since x 4-jy = r (cos +j sing), then, by De Moivre's theorem, it follows that 


Мх +ју = Vr (cos Е +jsin =) 
n n 
However, using this equation, we obtain one root only. In order to obtain all the roots 
we must consider the fact that the cosine and sine functions are periodic functions 
of period 2л radians ог 360°. Thus we can write 
(cosa +j sina)” = [cos(a + 27k) +j sin (a + 27k)|" 
= cos(na + 2z nk) 4- jsin(na + 2z nk) 
where k = 0,+1,+2,+3,--- 


, 


When raising a complex number to an integral power there is no ambiguity: the 
result is independent of periodicity. But extracting the roots of a complex number 
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means raising it to a fractional power. Now periodicity becomes important, and 
we have 


2 2 
VxTjy- Vr [cos (1623) +jsin (12-28) 
n n n n 


where k = 0,+1,+2,--- (9.12) 


Rule The nth roots of a complex number are obtained by extracting the 
nth root of the modulus and dividing the argument by n. Due to the 
periodicity of the trigonometric functions, there are n solutions. 


By giving k the values 0,1,2,3,--- ,(m — 1), we obtain the n different roots of 
a complex number; for example, 


k=0, д = Ur (cos = + jsin=) 
n n 


2 2 

К-1, a= Ye es (2+2) +jsin(S +) 
n n n n 
4 4 

k=2 sza = Ye eos ($+) +jsin(S 4) 
n n n n 


and so on. With k =n, we would obtain the same value as with k = 0; also we would 
not obtain any new values for k > n or = —1,—2,—3,---. The root corresponding 
to k = 0 is called the principal value. 


Example Calculate the four roots of 


z^ = cos 23 +jsin 23 
Su E M 


In this case r = 1 and n = 4; hence 


CAES NN 
2 = cos (Z+ Zk) isin (Z+ 23 


= cos (z^ Zk) +jsin (z- Zk) 


The roots are 


1 1 
k=0, д = cos + јап = 2 V3 7j 


6 
4л .. 1 v3. 
k=l; 22 = cos == +jsin -= = 265 
k=2, 23 = соѕ ne ee УЗ 1. 
Si c c A d AE. 
10x 10x 1 V3 
k= = NA eM e iue, Н 
3, z4-cos 6 +jsin 6 5 21 
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The roots z; and 2з, as well as 22 and 24, are opposite to each other; moreover, we 
can see that the argument, starting from the principal value, is successively increased 
by x /2. Figure 9.15 shows all the values on a circle of radius 1. They form a square. 


Fig. 9.15 


All the nth roots of a complex number of modulus 1 have modulus 1. When 
depicted in an Argand diagram they form the vertices of a regular n-sided 
polygon inscribed in a unit circle. 
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Appendix 


Summary of operations with complex numbers 


Designation Formulae 
Imaginary unit j j?^2-1 
Imaginary number 7 п — jy (y real) 
Complex number z in arithmetic form Z=x+jy (x,y real) 

х = real part 

y = imaginary part 
Complex conjugate z'-x-jy 
Complex numbers in polar form Z=r(cosa+j sing) 

= a ees, 

Transformation (x, y) = (r,@) T d дах } 

y=rsina tana = y/x 
Complex number in exponential form 2- rei 
Euler’s formula el = cosa +j sina 

Dr. : 

Exponential form for cosine cosa = 5 (е0 +eJ%) = cosh ja 


and sine functions 


l5 А 1 
sina = —(e/* — е7) = — sinhja 
2) j 


z-rel* 


Periodicity of complex numbers NM (k =+1,+2,43,---) 


Multiplication and division 2р = rie, 25 = r3 el*2 


in exponential form : 
р 2122 = ry roel +9) 


ae FI gio 92) 


Z2 12 
Raising to a power and extracting z —rel* 
roots in exponential form git gne 


Wz = Vrell(o*274)/n] (k — 0, +1, +2,---) 


Multiplication and division 21 =r (cosa, —jsino) 
in polar form 


Z5 = r2 (cosa +j sina) 


2129 = rir2|cos(o1 4 6/2) - jsin(o +@2)] 


21 ГІ vis 
[cos(o4 — &2) +j ѕіп(о — «2)] 
zx. 
Raising to a power and extracting z =r(cosæ +ј ѕіпа) 
roots in polar form z” = r” [соѕпа +j sinna] 


cope 


Exercises 


Exercises 


9.1 Definition and Properties of Complex Numbers 


1. Express the following in terms of j: 


(а) V4—7 (b) ү/-144 
(c) x (4) /4(-25) 
2. Compute 
(a) jë (b) 17 
(c) jf? (d) (-17 
3. Evaluate 
(a) /—48+/—75— /—27 (b) /—12— /—-8+ /—0.6 
(c) V -3V/.-3 (4) /-av-b 
(e) 5j?2* 6 c3"? 
(p 8j/2j ( 1/7? 
(i) 6j! v3 91g 4015" 
(k) Ур ауа Б " /-3у12 
jv—a? 
4. Determine the imaginary part of z: 
(a) 2= 3+7) 
(b) 2= 15] – 4 
5. Determine the conjugate complex number z* of 2: 
(a) 2=5 +2) 
©) 2= 5-3 


6. Evaluate ће (complex) roots of ће following quadratic equations: 


(а) x? +4x+13=0 


3 25 
b) x24 2 — = 
(b) x 3*8 0 
7. Calculate the sum 2] + 22: 
З 3 3, 
(a) z1 = 3— 2j (b) aco 
: 3 3, 
Z5 —14- 5j 2= 2—1] 
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8. Compute w = Z1 — 22 + 23°: 


(a) 2] = 5 – 2] (b) zı =4-— 3.5] 
22 =2 — 3) 22 =3 +2) 
Z3 = —4- 6j Z3 = 7.5) 
9. Compute ће product ш = 2122: 
(а) zy = 1+] (b) 21-3-2) 
Z2=1-j Z5 =5+4) 


10. Determine 


(а) (16+jv2)/2v2 (5) (4- j 3)/3j 
(с) (2+3))/(2—4)) (d) 1/(1+)) 
ey Need 134 (y EHIS —3У3) 
I-j 14j 2-1у3 
11. Convert the following sums into products: 
(a) 4x? + 9y? (b) a+b 


9.2 Graphical Representation of Complex Numbers 


12. Plot each point z; and —z;* in the complex number plane: 


(а) 21--1-1 (5) z2 234 2j (c) z3 — 54-3j 
3 
d) 24-11 18 
(Ота) (e) 25---34-2) (f) 26 = V2 
13. Using Fig. 9.16, determine the real and imaginary parts of each point 
21:22:77 ,26- 


Fig. 9.16 
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14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


Convert the complex number z = x + jy to the polar form 
z = r(cosa +j sina): 


(a) 2-1-1 (by z=-(1 +j) 


Transform the complex number z = r(cosæ +j sing) into the form z = 
X+jy: 


(a) z — 5 (соз T -jsin =) (b) z = 4(cos225° + j sin 225°) 
Compute 2122: 
(а) z1 = 2(соѕ 15° +jsin 15°) (b) z1 = V5(cos 80° +jsin80°) 
Z2 = 3(cos 45° + jsin 45°) Z2 = V5(cos 40° +jsin 40°) 
Calculate z4 /Z2: 
(а) д] =cos70° +jsin 70? (b) zy =4 
Z2 = cos25? +jsin 25° Z2 = 4(cos30° + j sin 30°) 


(Hint: 4 = 4(cos 360° +jsin360°)) 


What is meant geometrically by the multiplication (or division) of a complex 
number by —j? 
Calculate 


(а) (1-17 (b) 2128 i 
2 12 


(a) Prove that (cos 50° — j sin50°)* = соѕ200° — j sin 200°. 
(b) State De Moivre's theorem. 


Calculate all the roots of 


(a) у-5--12) (b) d/cos 60? +j sin 60° 


9.3 Exponential Form of Complex Numbers 


22. 


23. 


Using Euler's formula, compute coso and sino and convert to the alge- 
braic form: 


(a) ејт/2 (Ь) ејт/3 
Let the values for еі апіе-!° be given. Compute the values of o, coso and 
sina: 
(a) еј = 1 (b) ејб = –1 
еј = 1 e ie = -1 


ја: 1 ; 
(uk ope à) elt = 37 


А T. ee J 
-jæ — ,/3— L 
5 2 3-75 
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24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33; 
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Given the complex number z = x +jy, w = e? is then a new complex num- 
ber. Put it in the form ш = re!” and compute r and о if 


(а) z—342j () z-2-Z 
Transform the complex number ш = e? into the form ш = и + jv if 
(a) Тал (b) ЗЭ?” 
а) 2= = z—-— 
2 2, сав 
(©) z=-1-j> () z-3-j 


Let the complex quantity z be a linear function of the parameter / (for ex- 
ample time), i.e. z(t) = at +jbt(0 € t € eo). Given 
(a) z(t) = —t-- j2zt, 
3 
(i) what is the real part, Re[w(t)], of w(t) = e? 
(ii) what is the period of Re[w(t)]? 
Gii) what is the amplitude of the function w(t) at time t = 2? 


Compute the product 21 22: 
Я 1. 
(а) zy = 2еі7/2 (b) д) = 2pm 
l 3 . 
25 = 5677 Zo = sore 


Calculate for the pairs of numbers 2,22 in the previous exercise the quotient 
Z1 Е / 22. 


(а) Given 2 = ?2еіт/5, calculate 27. 
(b) Given z = seins, calculate 23. 
(a) Given z = 32e/!9, calculate 21/5, 
(0) Given z = 7 167. calculate 2174. 


Given z = re!”, what in each of the following cases is the angle œ for which 
0 <а < 2л? 


(а) z = 3eJ?7 а sellers 


Put the following complex numbers into exponential form: 
(a) 5— 5j (b) 15-13) 


(a) Put the expression z — 2.5e43^30' into the form x +jy. 
(b) Calculate e/149 с-182” and express the result in the form z = x 4- jy. 
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9.4 Operations with Complex Numbers Expressed in Polar Form 
(1-3)? 

v2 - j) 

(b) What is the polar form of this complex number? 


35. Put z = —2(cos30° — j sin 30°): 


34. (a) Determine the real and imaginary parts of 


(a) into the form x 4-jy, 
(b) into exponential form. 


Chapter 10 
Differential Equations 


10.1 Concept and Classification of Differential Equations 


Many natural laws in physics and engineering are formulated by equations involving 
derivatives or differentials of physical quantities. 
An example is Newton’s axiom in mechanics, which states 


Force = mass x acceleration 


The acceleration is the second derivative of the displacement x with respect to the 
time f. The law can be written as 


Е =mx(t) 


The force F may be constant or a function of the displacement, a function of the 
velocity v or a function of some other parameter of the system. We are interested in 
the displacement as a function of time, i.e. 


x — x(t) 


To find it, we must solve Newton's equation. 

An equation containing one or more derivatives is called a differential equation. 
In what follows we will use the term DE for short. 

Let us consider a concrete example. The motion of a body of mass m falling 
freely (see Fig. 10.1) is described by the DE 


mx = —mg 
or x = -g 


(Air resistance is neglected; g, the acceleration due to gravity, is 9.81m/s?.) We 
require the displacement x(t), which is determined by the DE given above. 


K. Weltner, W.J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
ISBN 978-3-642-00172-7 © Springer 2009 
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Fig. 10.1 


Later we will show how to solve such an equation systematically, but for the time 
being we will merely quote the result. 

The position of a body at an instant of time t which satisfies the DE X = —g is 
given by 

1 
x(t) = sat? +С +С 

Cı and C» are arbitrary constants. You can verify the correctness of this solution 
for yourself by differentiating it twice. Thus a DE serves to determine a required 
function, unlike an algebraic equation which determines numbers. 

In general, the DE of a function y(x) may contain one or more derivatives of that 


function, as well as the function itself and the independent variable x. 
Examples of DEs are 


y" 4 x2y! + y? e sinx = 0 
y"+x=0 
e*y'—-3x = 0 


Among the great number of possible types of DE encountered in physics and en- 
gineering the most important ones are the linear DEs of the first and of the second 
order, with constant coefficients. 

But first we must define the terms order of a DE and linear DE. 


Order of a DE 
Definition The order of a DE is defined by the highest derivative contained 
therein. Thus an nth order DE contains an nth derivative. 


Examples are 


у+ах = 0, which is of the first order 
and у! +7у = 0, whichis of the second order. 
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Linear DE 


Definition If the function y and its derivatives (y’,y”,...) in a DE are all 


to the first power and if no products like yy’, y' y" etc. occur, 


then the DE is linear. 


Examples are 


y'-7y--sinx =0 and 5y = ху which are linear DEs. 
y'--y?—0 and (у")2 = х2у which are non-linear DEs. 


Linear DE with Constant Coefficients 


Definition The DE 
a2 y" - ay' +аоу = f (x) 


where az Æ 0 and a5,a, and ао are arbitrary real constants, is 
called a second-order linear differential equation with constant 
coefficients since all a ; are constants. 


Given the linear DE with constant coefficients 
a2 y" -ату  aoy = f (x) 


We must distinguish between two cases: 


f(x)-0 and f(x) 40 


If f(x) = 0, then the DE is referred to as a homogeneous DE. If f (x) 4 0, then it 
is referred to as a non-homogeneous DE. 


A homogeneous DE is my" t yy' Ку = 0 
A non-homogeneous DE is my" +yy'+ky =sinwx 


If in the DE in the definition above, a2 = 0 and a, # 0, the equation becomes 
a1 y' +aoy = f (x), which is a first-order linear DE with constant coefficients. (a1 
and ao are real numbers.) 

The following equations are examples of first-order linear DEs: 


y'—gt-0 
y -xy-0 


Every function which satisfies a DE is called a solution of that DE. The purpose 
of this chapter is to deal with the problem of finding solutions of DEs. Before pro- 
ceeding further, let us consider the solution of the equation y" = —g. The following 
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equations are possible solutions of this DE, as can easily be verified by inserting in 
the DE: 


у = —zgx^-Cix-4 C3 


1 
ya = 58x? + C2 


1 
уз = -zgx* +Cix 
and y4 = —4gx 


The solutions Уг, уз and уд are obviously special cases of the solution у. They are 
obtained when constants are set to zero. We are allowed to give Сү and C2 any value 
we like, e.g. 

Сү--1, Co=5 


Hence the solution y = —1/2gx? — x + 5 is another solution of the DE y" = —g. 
Thus we have made it clear that y, is a solution of the DE, no matter what values we 
assume for C, and C2. This implies that the solution of the DE is not uniquely deter- 
mined. The constants which appear in the solution and which we can choose freely 
are called integration constant. The solution is referred to as the general solution 
before the constants are evaluated. 

The number of constants which appear in the solution of a DE is determined by 
the following lemma. 


Lemma 10.1 The general solution of a first-order DE contains exactly one 
undetermined integration constant. The general solution of a second-order 
DE contains exactly two integration constants, which can be chosen indepen- 
dently of each other. 


This statement follows from the fact that a first-order DE requires one integra- 
tion and hence one constant of integration, while a second-order DE requires two 
integrations, and hence two constants of integration. 

A special solution of the DE is obtained by assigning particular values to the 
constants in the general solution. The special solution is called a particular solution 
or a particular integral. 

In the example above, the second, third and fourth solutions are particular so- 
lutions of the general solution, i.e. of the first solution (С = 0,C5 = 0, and 
C, = Сә = 0, respectively). 

We are, above all, interested in the general solution, since it contains all the par- 
ticular solutions. A particular solution is obtained if additional conditions are im- 
posed. These conditions are referred to as boundary conditions. 

There is a similarity with the problem of integration. An indefinite integral is 
a general solution, while the definite integral is the particular solution when certain 
conditions are imposed, such as the limits of integration. 
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The constants in the general solution of a DE are chosen in such a way as to 
satisfy the boundary conditions. The problem in physics and engineering is that 
of obtaining a particular solution by fixing boundary conditions in order to solve 
a particular case. 

We will now develop methods for solving first- and second-order DEs with con- 
stant coefficients. 


10.2 Preliminary Remarks 
It has already been mentioned that a special case of the linear second-order DE is 
obtained by setting az = 0 in 
azy" t diy  aoy = f(x) 
The result is the linear first-order DE 
ayy’ +аоу = f (x) 


For this reason, we will derive the solution for the second-order DE and only refer 
briefly to its application to the first-order DE. The main reason for doing this is 
that in physics and engineering many of the problems we meet lead to second-order 
DEs. 

Finding a solution for the non-homogeneous second-order DE is made easier by 
the following lemma. 


Lemma 10.2 Consider the non-homogeneous DE 
а2у" t a1y' +аоу = f(x) 
Let ус be the general solution of the homogeneous equation 


a2 y" -- ai y! +аоу = 0 


yc is also called the complementary function. 
Let ур be a particular solution of the non-homogeneous DE 


азу! t a1y' +аоу = f(x) 
Then the general solution of the DE is given by 


y — Ус- yp (10.1) 
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Proof We will first show that y = yc + yp is a solution of the DE. 
According to the assumptions we have made for the homogeneous DE, 


азус! Бату +аоус = 0 [1] 


For the non-homogeneous DE we have 


42 p^ + аур +аоур = f(x) [2] 


Substituting y = yc + yp in the non-homogeneous equation gives 


а2(Ус- yp)" +41 (ус + yp) 4- ao(yc + yp) = f(x) 


Rearranging gives 


(a2 yc" + a1 ус +аоус) + (a2 yp" +41 yp’ t aoyp) = f (x) 


But the first bracket is zero, according to Eq. [1], and the second bracket is in accor- 
dance with Eq. [2]. It follows that y = ус + yp is a solution of the non-homogeneous 
DE. Furthermore, since we assumed that Ус is the general solution of the homoge- 
neous DE, it contains two arbitrary constants (cf. Lemma 10.1). Hence the solution 
y = ус + yp also contains two arbitrary constants which can be chosen indepen- 
dently of each other: it is the general solution. 

According to Lemma 10.2, the general, or complete, solution of 


a2y" t a1y' +aoy = f (x) 
can be achieved in three steps: 


Step 1: Find the complementary function ус of the homogeneous equation. 
Step 2: Find a particular integral yp of the non-homogeneous equation. 
Step 3: Add both solutions to obtain the general solution of the non-homogeneous 
equation: 
y = Ус yp 


To solve DEs, physicists and engineers will often look up solutions from a col- 
lection of solutions and will only try to find solutions for themselves when such 
a collection is not at hand. Even in these circumstances they will not necessarily 
follow a systematic procedure that is always successful; instead they will try to find 
a solution and then use the principle of verification to prove that it is valid. If the 
assumed solution is found not to be valid, it is modified and the process repeated 
until a valid solution is found. To guess successfully requires experience which the 
learner, obviously, does not possess; in the following section we will therefore con- 
sider systematic methods of solution. 
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10.3 General Solution of First- and Second-Order DEs 
with Constant Coefficients 


10.3.1 Homogeneous Linear DE 


In this section, we derive a method for finding solutions of first- and second-order 
homogeneous DEs with constant coefficients. The method is always successful. 


Homogeneous First-Order DE 


We will consider briefly the first-order DE with constant coefficients 
ayy’ +aoy =0 


Rearranging the equation gives 


d 
а =—doy 


or by ‘separating the variables’: 


d 
ee ea dx 
2 a1 
Integrating both sides gives 
lny =— ши + constant 
ay 


For convenience, we can write In C for the constant. The solution is 


y=Ce'l*, where ry = – — 
This type of equation is frequently encountered in, e.g. the decay of radioactive 
substances, the tension of a belt round a pulley, the discharge of a capacitor in an 
electric circuit. 

Note: If it is possible to write any DE with only x terms on one side and only 
y terms on the other, the solution can be obtained by straightforward integration of 


both sides. This is called separation of variables. 
Homogeneous Second-Order DE 
We now seek a general solution of the homogeneous second-order DE with constant 


coefficients, i.e. 
a2 y" -- ai y! +аоу = 0 
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The solution of this equation will contain two arbitrary constants, corresponding to 
two different solutions y, and y2. The solutions must be linearly independent, i.e. 
they cannot be represented Бу y, = Су», where C is some constant, for all values 
of x in the interval considered. The following lemma will be useful for finding the 
general solution. 


Lemma 10.3 /f the homogeneous linear DE 


ary" *- ay! +аоу =0 


has two different solutions yı and y», then the following expression is also 
a solution of the DE: 


y = Су + С2у2 


Cı and C2 may be real or complex quantities. This expression is the general 
solution of the DE. 
Proof We assume that 


азу1! азу +аоу = 0 [1] 
a2y2" +a y2' +аоу = 0 [2] 


Substituting у = Сту + С2у2 іп the DE gives 


а (Су + C2y2)" +а1 (Су + C2y2)' +ао(Сіу + Су) = 0 


Rearranging the terms gives 


Cy (азу1" - a1yi' +4091) + C2(a2y2 +41 y2 + 49y2) =0 


By [1] and [2], both expressions in brackets are identically zero. Hence we have 
proved that 
y=CiyitCry2 


is a solution of the DE. It is the general solution since it contains two arbitrary 
constants. 


We must find two linearly independent solutions yı and y2. Guided by the results 
for the first-order DE, we assume that the second-order DE is solved by functions 
of the type y = e"*. The admissible values for the unknown г are to be determined. 
Table 10.1 shows the systematic procedure and an example. 

The roots of the auxiliary equation will depend on the values of the constants 
а2,а1 and ао. We must therefore examine these roots carefully. There are, in fact, 
three cases to distinguish. 
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Table 10.1 


Systematic procedure for the solution of the Example 
homogeneous second-order DE 


Let the equation be 


азу" t a1y' +аоу =0 y" +3y'+2y =0 
Let y = e"* be a solution of the DE. Substituting 
for 
firs rx LL Pe эх dy - rx 
Уу =re yeu ae ue 
and y" = p2elX PER dy =p2elX 
dx? 
gives apr2e"* +a ,re"* cage" * =0 
We can factorise е”: 
е" (apr? +aır -- ag) =0 e"*(r? 637 -2) 20 
Since e"* Æ 0, the expression in the bracket must 
be zero: 
azr? - a,r - a9 = 0 r243r+2=0 


This is a quadratic in r. It is called the auxiliary 
equation of the DE. Its roots are 


—a| c V a4? — 4а2а0 


T)2 = ry —--h 0 = -2 
й 24» 
Provided that rı and r2 are different, the general 
solution of the DE is 
y- Суе"1х + Сзе'2х y= Сех + Cre 2* 
Let us recapitulate. Given the DE 
азу" +a y' +аоу = 0 
we found the solution to be у= Се"! * 4 Сое'2* 
(10.2) 
! —a, t үа12-4ага 
with 712 = ! : 250 
й 2а» 
Case 1: The expression a4? — 4азао is positive. 
Here the roots are real and unequal. 
Example Solve 2y" 4- 7y' -- 3y = 0. 
The auxiliary equation is 2r? + 7r +3 = 0. 
The roots are rı = —0.5,r2 = —3, and the general solution is 


y- Cel + Cen 


The solutions are combinations of exponential functions. A detailed discussion of 
this type of solution with respect to applications can be found in Sect. 10.4.2. 
Case 2: The expression a1? — 4а2а0 is zero. 
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Here the two roots are equal and so far we know only one solution, namely ех: 


We need to find a second solution. Let us assume that it is of the type у = ue'*, 
where u is some function of x. Differentiating we find 


у! — we" +rue™ y" = ие" 4 2rule^* + r2ue™ 


Substituting in the DE gives 


а] 
use r = ——— Jaou"e'* = 0 
2а2 


The DE is satisfied only if u” = 0, i.e. if u is a linear function 
u = C1 + C2x 
Then the solution of the equation is 
y = (C14 Cox)e'* (10.3) 
As it contains two arbitrary constants, it is the general solution. 
Case 3: The expression a4? — 4aaa is negative. 
Here the roots rı and r2 are complex conjugates. As we are concerned with real 


solutions we must show how complex roots lead to real solutions. 
To simplify, let the roots be denoted by 


ry=a+jb and r2—a-jb 


1 
where gel. add b = —X4a3ao — 412 


2а2 2a2 
The general solution of the DE is then 
y = СүеӨЧ» + Сзе(2—12)х 
= ех (cie? + Сех) 


From Euler’s formula ((9.4) in Chap. 9, Sect. 9.3.1) 


ex = cosx +јѕіпх 
Substituting for the complex exponential gives 
y =e |(Cy + C3) cosbx -- (C1 — Сә) sinbx] 
or y =e**(Acosbx + Bsinbx) (10.4) 


where A = C, + С and B —j(C4 — C5). 
To demonstrate that we can obtain a real solution from this general complex 
solution, we will consider the following lemma. 
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Lemma 10.4 Let the solution of the homogeneous DE 
а2у" +a1y'+aoy =0 
be a complex function y of the real variable x so that 


y = yı (x) +jy2(x) 


The constants a5,a4 and ао are assumed to be real. 
Then the real part yı and the imaginary part y2 are particular solutions, 
and the general real valued solution is given by 


y=Ciyi+Cry2 (10.5) 


with real constants Сү and Сэ. 


Proof According to our assumption, we have 


aan t jy2)" cai t jy2) +ao(y1 t jy2) = 0 
Collecting real and imaginary parts gives 
(a2 yi" +аџу! +aoy1) tj (a2 ya" Tay» +aoy2) =0 


But a complex number is exactly equal to zero if the real and the imaginary parts 
are zero at the same time. Hence 


a2y1" +a1y1'+aoy1 = 0 
and d? y2" +a, y2' +аоу = 0 


From this it follows that both y; and y» are solutions of the DE and, according to 
Lemma 10.3, the general solution is 


y=Cyy1+Cry2 


thus proving the lemma. 
We can now state that if the auxiliary equation of the homogeneous DE has con- 
jugate complex roots rı = a+jb and r2 = a — jb there is a real solution given by 


y = e** (Acosbx + Bsinbx) 
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Example Solve y" +4y’+ 13у = 0. 
The auxiliary equation is r? + Ar + 13 = 0, 
whose roots are ry = —2 + 3j and r2 = —2 — 3j. 
By the above, the general solution is 


Y =e ?* (Ci cos3x + Cosin3x) 
Summary The solution of the homogeneous second-order DE 
азу" +a1y' +аоу =0 
with constant coefficients may be summarised by the following steps. 
Set up the auxiliary equation. 

y" is replaced by r? 

у! is replaced by r 

y is replaced by 1 


The auxiliary equation is azr? +aır + ао = 0. 
Calculate the roots rı and r2 of the auxiliary equation: 


-а1 Œ va? - 44240 


2а2 


712 = 


Obtain the general solution according to ће following three possible cases. 


Case | If r1 Z r2 are real and unequal roots 

у = Cye"l* + Сзе'2* (10.2) 
Case 2 If r1 = r2 are equal roots 

y =e'l* (Cy + Cox) (10.3) 
Case 3 If r1 and r2 are complex roots with 


ry =a +jb and т =a—jb 


10.4 
y =e” (Cı cosbx + Casinbx) ai 
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10.3.2 Non-Homogeneous Linear DE 


According to Lemma 10.2, the complete solution of the non-homogeneous DE is the 
sum of the complementary function and a particular integral. We have learned how 
to find the complementary solution, i.e. the solution of the homogeneous equation. 
We must find methods for obtaining a particular solution. One method, called the 
variation of parameters, which always yields a solution is discussed later in this 
section. The only problem with this method is that it is long-winded. Consequently 
we often tend to find a way of guessing a particular solution. You may think this is 
unsatisfactory, but with practice you will soon appreciate its value. 


Example Find a particular integral of the DE 
у'+у= 5 
One such particular integral is 
Ур = 5 
since yp" = О and ур = 5 satisfy the DE. 
Generally, if the right-hand side of the non-homogeneous DE is a constant, so that 


азу" taiy *aoy = С (ао 70) 


then a particular integral is yp = C /ао, since ур = 0 and yp” = 0. 


Solution by Substitution or by Trial 


Given the DE a2 y" +a, y +аоу = f(x). We wish to obtain particular solutions 
for typical functions f (х), the right-hand side of this non-homogeneous equation. 

The most important cases encountered in practice are those where f (x) is of the 
type C едх C sinax, or C cosax or of the polynomial type. 

If the function f(x) is the sum of two or more types, a particular solution is 
found for each term separately and then these solutions are added. Note that the DE 
is linear! 

Polynomial function / (х) = a+bx+cx? +... іп which a,b,c,... are constants. 

The only functions whose differential coefficients are positive integral powers of 
the variable x are themselves positive integral powers of x. Hence, for a particular 
integral, we assume 

Yp =A+Bx+Cx?+.... 


The degree of the function assumed for ур must equal the degree of f(x), and 
no powers of x can be omitted, even if the RHS of the DE does not contain all 
powers. Substituting yp and its derivatives in the DE and comparing coefficients of 
the different powers of x gives equations for the coefficients A, B, C... 
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Example Find a particular integral of the DE 
y! =3y' + oy =3'— 2x? 
Since the RHS is a quadratic, we assume 
Ур-А-Вх-С x? 


Hence ур = B+2Cx and yp” = 2C 
Substituting in the DE gives 


2C —3(B+2Cx)+2(4+ Bx+ Cx?) =3— 2x? 


Comparing coefficients we find 


for х2, 2C 2-2, С--1 
for x, -6С-28В--0, B=-3 
constant terms, 2C —3B+2A=3, А--2 


A particular integral is 
Ур--2-3Х- x^ 


Exponential function f(x) = Ce^*. 
We have seen that differentiating an exponential gives an exponential. Hence we 
assume for a particular solution that 


ур = Ae?* 


Substituting in the DE, we find 
(а2А2 +а1А +a) Ас! = Сод” 
The unknown factor A is then given by 
Е С 
— à3À? РајА+ао 


If, however, e^* happens to be a term of the complementary function the method 
fails, since a2A2 +a,A + ао = 0. In this case, we can substitute Ур = Axe^*, 
Should this fail because xe^* is a term of the complementary function, then we 


assume ур = Ax?e4*, and so on. 
Example Find a particular integral of 


y" — Ay! - 3y = 5e73* 


3X. 


The roots of the auxiliary equation are 3 and 1. Thus e ^* is not a term of the 


complementary function; hence we assume 


—3x 


yp = Ae 
yp = -34e 
Yp” = 94e ?* 
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Substituting in the DE gives 


[9 — 4(—3) + 3] Ае ?* = 56:35 


5 
that А = = 
so tha 24 
A particular integral is 
5 -3х 
1217 


The complete solution is 


5 
22 3х X4 7 2-3Х 
y = Се" + Coe age 


Example Suppose that the RHS of the previous example was 5e*. As e* is a term 
of the complementary function, we assume 
Ур = Axe” 
yp = A хех + Ae” = A(xe* +e”) 
Yp” = А xe” + Ae* + Ae” = A(xe* + 2e*) 


Substituting in the DE, we have 


(x+2—4x—4+3x)Ae* = 5e* or —2A-25 
5 
H А = – = 
епсе 2 
A particular integral is 
2 
Yp 2 


The complete solution is 
3x x 5 x 
y = Се + Се" — 539 


Trigonometric function f(x) = Ri ѕіпах + R2cosax. 
The differential coefficients of sine and cosine functions are trigonometric func- 
tions also. We therefore assume for the particular integral that 


yp — Asinax + Bcosax 


We then calculate the derivatives, substitute in the DE and compare the coefficients 
of sine and cosine in order to obtain equations for A and B. 

Ifthe complementary function contains terms of the same form, i.e. sinax,cosax, 
the method fails and, as for type 2, we substitute 


y = Axsinax + Bxcosax 
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Example Solve y" — 3y'-- 2y = 7sin4x. 
The roots of the auxiliary equation are гу = 1, r2 = 2 
The complementary function is 


Ye = Cye* + Сое?“ 
To find a particular integral, we assume that 


yp — Asin4x + B cos4x 
yp = 4Acos4x — 4B sin4x 
Yp” = —16Asin4x — 16Bcos4x 


The DE becomes 


—16Asin4x—16Bcos4x—12Acos4x+12Bsin4x+2A sin4x+2Bcos4x=7sin4x 


We compare the coefficients of sin4x and cos 4x: 


—-14A4+12B=7 
—14B—12A=0 
—49 21 
H А = — а B=—. 
епсе 170 ап 85 


The general solution is 


49 21 
у= Спе + Сое? — 1709114 + gs 00541 


Example Solve у” + 9y = ѕіп Зх. 
The roots of the auxiliary equations аге 


= 3) x Je —3j 
The complementary function is 
yc = Стсоѕ3х + C5sin3x 


Since f(x) = sin3x is a term of the complementary function, we assume for a par- 
ticular integral that 


yp = Axsin3x + Bxcos3x 
Thus Yp = 3Ax cos 3x + Asin 3x — 3B x sin 3x + B cos 3x 
and Yp = —9Ax sin 3x + 6Acos3x — 9B x cos 3x — 6B sin 3x 


The differential equation becomes 


— 9Axsin3x + 6Acos3x 
— 9B xcos3x — 6B sin 3x + 9Ax sin 3x + 9B x cos 3x = sin 3x 


10.3 General Solution of First- and Second-Order DEs with Constant Coefficients 289 


Comparing the coefficients of sin3x and cos3x, we find 


1 


В---, A=0 
6 


2 


The complete solution is 


1 
y = Cı cos3x + C2 sin3x — g^ 008 3x 


Method of Variation of Parameters 


Let us consider the linear non-homogeneous DE with constant coefficients. No as- 
sumptions are made about the type of f(x). 


azy" --a1y' +аоу = f (x) [1] 


Let y; and y2 be independent solutions of the homogeneous equation. We then 
know that the complementary function is ус = C1 y1 + C2ya. 
We need to find a particular integral yp. We assume that it is of the follow- 
ing form: 
Yp = Va yı + Vaya [2] 


Vı(x) and V2(x) are two functions of x to be determined. Hence we require two 
equations for the two unknowns, Ут and V2. 

Substituting Eq. [2] in Eq. [1] gives one equation which must be satisfied by Vj 
and V5. We then try to find another equation which will simplify the calculation of 
V4 and V2. Although this equation may be chosen arbitrarily, it must not contradict 
the first equation. Differentiating Eq. [2], we find 


yp = (yi + Vaya) + (Vi yı + V272) 


Ур can be simplified by choosing the second equation for the unknowns V; and V5 
to be 


Vivi + Vaya =0 [3] 
Hence Ур = V1 yi + у [4] 
and yp = V1 yi + Voy? + Vi yi + V272 [5] 


Substituting Eqs. [4] and [5] in Eq. [1] and rearranging, we have 


Vi (a2y] +a1y1  aoy1) + V2(a2y3 ау +аоу2) +a2(V1 y1 + Vay5) = f(x) 


Since yı and y» satisfy the homogeneous DE, the expressions in the first two brack- 
ets vanish. Hence we have 


a;(Viyi + Vay) = f(x) [6] 
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Equations [3] and [6] are really the only ones that concern us. It is these that we 
have to solve in order to find Үү and V5. We now obtain 


yz -f(x)ya р! = f(x) 


= х Wea (10.5a) 
1 az(y1yh—y1y2)?  ? ar(yiyh—yiy2) 


Since yı and y2 are independent solutions of the homogeneous equation, the de- 
nominator does not vanish identically. 

If we denote the expressions on the right-hand sides by gı (х) and g2(x), then 
V4 and V2 are obtained by integration: 


Yi = | six) ах, v= | go(x) d (10.55) 


You may have noticed that this method is somewhat lengthy. 
This is the reason for attempting the trial solution approach first. 


Example Solve y" — y = 4e*. 
The complementary function is 


Ye = Спе“ + Coe * 


ie. yy =e*, yo=e* 
For a particular integral let 
yp = ће“ + Voe* 
First, we compute the denominator of the integrands: 
УлУэ-Ууу =e" (—e *)—-e%e * = —2 
Second, we compute the parameters V; and V2 by integration: 
Үү; = Їнэн dx = ;j sete dx = 2x 
V2 = 1 oon dx = 5 [ае dx = —e?* 


Third, we can write yp explicitly: 


yp = 2xe* — e?*e * =e*(2x—1) 
The complete solution of the DE is 


y = Спе" + Cae * (2x — T)e* 


10.4 Boundary Value Problems 291 


10.4 Boundary Value Problems 


10.4.1 First-Order DEs 


Let us consider the equation ауу! + аоу = 0. The auxiliary equation is ry = 
—ag/d,. The solution is 
y- Селїх 

Since С can take on any value, there is an infinite number of solutions. But we often 
know the value of the function or its derivative at a particular point. For example, 
we might state that for a body in motion its velocity is vg at time 7 = 0. Such 
a condition is referred to as a boundary condition or initial condition, and this fixes 
the value of the constant C. The general solution becomes a particular solution 
because it satisfies a preassigned condition. According to Lemma 10.2 a first-order 
DE contains one arbitrary constant. This constant is determined by one boundary 
condition. 


Example Solve y’+3y — 0, so that when x — 0, y — 2 (i.e. the solution is to contain 
the point x — 0, y — 2). 
The general solution is 
у = Сех 


Substituting ће boundary condition, we have 
2=Ce=C; hence C=2 
Consequently, the particular solution satisfying the boundary condition is 


y = 2ẹe73x 


10.4.2 Second-Order DEs 


The general solution of a second-order DE has two arbitrary constants. We therefore 
require two boundary conditions to calculate their values. These conditions may take 
various forms. For example, the solution might have to pass through two points in 
the x — y plane, or it might have to pass through one point and have a certain slope 
at another. These conditions could be stated thus: 


at х= х, y=y ама! x=x2, у= у 
or dt x=x1, gy Jy andat x=x2, y =y} 
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Example Figure 10.2 shows a cantilever beam of length L supporting a load W at 
the free end. The DE is given by 


EIy'- M —0 


where the bending moment M at a section XX is Wx. The product Е is constant 
and is a property of the beam material and its cross-sectional dimensions. 


Fig. 10.2 


Solve the DE, given that 


when 


0 . 
and when \ boundary conditions 


The DE is Ely” = Wx 
This second-order DE can be solved directly by integrating twice: thus 


wW 2 
Ely'= +01 [1] 
W 3 
and Ely= <= +Cixt+ Cy [2] 


Let us now consider the boundary conditions. Since y’ = 0 when x = L, we have 


WL? 


Cy = 2 


Since y = 0 when x = L, we have 


WL? 


€) --— 
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The desired solution is 


sb WESS WES. WIES 
>= as 2 7 =) 
UM XS, ЭР 

" ГАС 255) 


Example Solve y" + y = 0, given that y(0) = 0 and y(x) = 1 (boundary condi- 
tions). 
The roots of the auxiliary equation are rı =j and r2 = —j. The general solution is 


y = Cı sin x + C2 cos x 
Substituting the first boundary condition in this equation, we have 
0 = Сі sin 0 + C5cos0 = C, x 0+ C5 


Hence C5 = 0 
The second boundary condition stipulates that 


1 = Cı sina + C2 cos) = C5 x 0+ C5 


Hence C5 = —1 

In this example, the two given boundary conditions contradict each other: they 
cannot both be satisfied if the solution is expected to be a differentiable function. 
No differentiable solution exists. 


10.5 Some Applications of DEs 


10.5.1 Radioactive Decay 


Let N (t) be the number of radioactive atoms present at time t. We assume that the 
rate of decay with time is proportional to the number of atoms remaining, i.e. 


If we introduce a factor of proportionality k, bearing in mind that the number of 
atoms is decreasing with time, the DE is 


d 

ac^ 0 = —kN(t) (k>0) 
or N+kN =0 
This is a homogeneous first-order DE whose solution is 


N = Се! 
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To fix C, we need to choose some initial value. For example, let No be the number 
of atoms at time f = ѓо = 0. Then 


No = Ce "o = ce? = С 
The particular solution is N= Noe * : 


10.5.2 The Harmonic Oscillator 


Free Undamped Oscillations 


Figure 10.3 shows a mass m on a spring of stiffness k (load per unit elongation). If 
the mass is pulled down by an amount x from the equilibrium position, the spring 
will exert a restoring force trying to bring back the mass towards that position. 


We 


Fig. 10.3 


By Newton’s second law of motion, 
mxX(t) = —kx(t) 
x+ozx=0, oO, = — 
т 
On = natural frequency 
This is a linear second-order DE. The auxiliary equation is 
r4 we =0 


The roots are rı = jo and r2 = —jan. 
The general solution is (cf. Sect. 10.3.1, Case 3) 


x = C4cosogt + C2 sin wnt 


We need two boundary conditions to determine the values of Сү and Со. 
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For example, the boundary conditions of an oscillation are 


x—0 at f—0 (position at the instant = 0) 
Хэр at 1-0 (velocity at the instant = 0) 


Substituting the first condition in the DE above gives 


0 = С соѕ0 + C5sinO 
Hence Су =0 


Substituting the second boundary condition gives 
X = ug = —@n C1 sin 0 + @n C2 cos 0 = o, C2 


Hence C2 = vo/@n 
The particular solution is 


vo . 
x = — sin wnt 
Wn 


showing that the motion of the mass is oscillatory at a frequency of wn rad/s and of 
constant amplitude vo /@n. 

The general solution of the DE is a superposition of two trigonometric functions 
with the same period (Fig. 10.4): 


x(t) = Cy coswnt + C2sin wnt 


According to the superposition formula in Chap. 3, Sect. 3.6.6, x(t) can be ex- 
pressed in the form 


x(t) = C cos(wnt — о) 


where C= C)? + Cp? 


and a = tan! (2) 


x(t) 


Fig. 10.4 
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Thus, if we start with 
x = C cos(wnt — a) 


as the general solution, we have two unknown constants, C and o. These are deter- 
mined by the boundary conditions as before. 
Since x = 0 at t = 0 and x = vo at t = 0, we have 


0 = C cos(—a) = C cosa 


Since С # 0, it follows that ој = 2/2 огоз = –л/2. 
Differentiating x gives 


X = —ogC sin(ogt — а) 


Inserting the second boundary condition gives 


vo = оС 
Therefore C= cn 
On 


The particular solution is 


Uo л Vo . 
x = — cos (ont Е 2) = — sin Wnt 
On 2 On 
which is identical to the previous solution. 
Finally, we could also have chosen the solution 


x = Cosin(ogt +a0) 


You should verify this for yourself. 


Damped Harmonic Oscillator 


The harmonic oscillator considered above is an ideal case. In reality, friction is 
present in all systems in the form of dry friction, viscous friction and internal fric- 
tion between the molecules in a material. Friction in whatever form slows down 
motion because it dissipates energy in the form of heat which cannot be recov- 
ered. No matter how small the friction is in a system (such as our spring-mass 
system) oscillations will eventually die out. The effect of friction is known as 
damping. 
The friction or damping force is given in some cases by 


Е--схХ 


where c is a friction or damping coefficient, x is the velocity and the minus sign 
indicates that the force acts in a direction opposite to the motion. By Newton’s 
second law, the equation of motion for our spring-mass system becomes 


mi+cx+kx=0 
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This is the DE of motion for free oscillations or vibrations, meaning that there are 
no external forces acting on the system. 
The auxiliary equation is 


mr2=+cr+k=0 


whose roots are 


—c , Vc2—4mk 


he 2m i 2m =P ae 


As we saw in Sect. 10.3.1, there are three cases to consider, these depend on the 
value of c? — 4mk, i.e. 


c?—4mk »0, c?—4mk «0, c?—4mk = 0 


Case 1: c? — 4mk > 0. 
This means that the roots are real and unequal. In this case the general solution is 


х= Спе! + Coe"2! 


=e С + Coe? 


This corresponds to an over-damped system, and its response from a given initial 
displacement is shown in Fig. 10.5. No oscillations are present. The system will 
return to the equilibrium position slowly. 


x(t} 


о t 


Fig. 10.5 


Case 2: c? — 4mk = 0. 
The roots are equal, i.e. r1 = r2 = —a. The general solution is 


х= (Ci Cot)e “ 


The system will return to the equilibrium position more quickly than the system in 
Case 1 but again there will be no oscillations. It is referred to as critical or aperi- 
odic and the damping is called critical damping. Its response from a given initial 
displacement and initial velocity is shown in Fig. 10.6. 
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x(t) 


о t 


Fig. 10.6 


Case 3: c? — Amk < 0. 
The roots in this case are complex conjugate, i.e. rj = —a +jb,r2 = —a — jb, 
with a 0. The general solution is 


xeu С + Coe it 
or x =e *'! [Ci (cosbt + jsinbt) + С(соѕ bt — jsin bt)] 
=e “(Acosbt + B sinbt) 
where A = C, + C2 and B = j(C4 — C2) and A and B are arbitrary. 
We should point out that although Сі and C2 may be complex, A and В аге not 
necessarily complex. As we are dealing with a real physical problem, the solution 
must be real, hence A and В must be real, which means that Су and C5 must be 


complex conjugate numbers. 
The displacement x may be put in another form thus: 


x = Ce “ cos(bt — о) 


An examination of this function shows that the system will oscillate, but the oscil- 
lations will die out due to the exponential factor. Its response from a given initial 
displacement and velocity is shown in Fig. 10.7. It is a damped oscillation. 


x(t) 


Fig. 10.7 
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Forced Oscillations 


The damped oscillator shown in Fig. 10.8 is now subjected to an exciting force 
given by Focoswt. Fo is constant and o is the frequency of excitation, or forcing 
frequency. 

Newton's second law gives 


mł +cx+kx = Fgcosot 


According to Lemma 10.2, the general solution is the sum of the complementary 
function xc and the particular integral xy. We have just examined the three possible 
solutions of the homogeneous equation; it now remains to find the particular inte- 
gral. The simplest approach in this instance is to use a trial solution, as discussed in 
Sect. 10.3.2. Hence we assume an oscillation at the frequency of the exciting force: 


Xp = xocos(ot — 01) 


СА ^ 
Ae 
ROS - 
р o cos wt 
k (spring) 
m (mass) 
€ (damper) 
Fig. 10.8 


Substituting in the DE and comparing coefficients, we find 


Fo 
хо = 
V(k = то2)2 + с202 
5 : wc 
an ano, = ————; 
17 кто? 


The general solution is 


X = Xc+ Xp, Xc = complementary function 


Fo 
ie. х= xcd cos(wt — a1) 
y (k = mo?)? + c2? 
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If there is damping in the system, the complementary function will die out after 
a certain time (known as the transient phase) and the motion will be given by 


Fo 
c 
у (К —mo?)? + c29? 
The system will oscillate at the frequency о of the excitation. This phase of ће mo- 
tion is called the steady state. Figure 10.9 shows the complementary function хс, the 


particular integral xp, and the response of the system from the instant the excitation 
is applied, i.e. х = хс + Xp. 


os(wt — 01) 


x, = Ce cos (bt — a) 


Xp = xo cos (wt — ay) 


Xo 


1 
1 
i 


І Steady state 
png в 
Ї 


Transient 


Fig. 10.9 
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From a practical point of view, the amplitude xo of the steady state is most im- 
portant. It depends on the excitation frequency w. If we vary w we will reach a value 
which will make хо a maximum. This condition is referred to as resonance because 
о corresponds to the natural frequency wn of the system. This maximum is obtained 
by setting 


4хо 

IM а (у 

dw 
which gives 

2 
c 
PEE 2 
m m MOT 

where 02 = k is the undamped natural frequency, and wg is the damped natural 
frequency of the system. 


P d damping 


2 Large damping 


{a) (b) 
Fig. 10.10 


If the system is undamped, then c = 0. We see that the excitation frequency 
corresponds to the undamped natural frequency and the amplitude grows beyond all 


bounds because the denominator in x9 = D. vanishes. This situation is shown 
in Fig. 10.10a which shows the amplitude of the steady state as a function of the 
excitation frequency. 

In practice the amplitude is reduced due to the presence of damping, no matter 
how small, as shown in Fig. 10.10b. The greater the damping the smaller the ampli- 
tude. With a small amount of damping the amplitude at resonance can be very large 
and engineers avoid this situation. 

The following sections offer, in a concise fashion, some further methods of solv- 
ing certain types of DE. In Sects. 10.7.3 and 10.7.4, concepts which have not yet 
been introduced will be referred to, namely ‘partial derivative’, ‘total differential’ 
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and ‘partial DE’. It may be advisable to skip the rest of this chapter during a first 
course and to return to it when the need arises. 


10.6 General Linear First-Order DEs 


10.6.1 Solution by Variation of the Constant 


The DEs discussed so far have constant coefficients, but that is only a special case of 
what we shall start to discuss now. In this section, we are concerned with linear first- 
order equations. First order means that no higher derivatives other than y’ appear; 
linear means that no powers of y and y' and no products like y y' appear. The general 
form is thus 


p(x)y' * a(x)y = f (x) 


The coefficients p and q are arbitrary functions of x and the following are examples: 


1 
yi tea ax?; with p(x)=1, quy a f(x)24x? 


vVxy-y-l, with р(х) = ух, 4(х)--1, f(x)=1 


We will now derive a method for solving first-order linear DEs which relies on the 
method of variation of parameters from Sect. 10.3. A quicker method, using the 
integrating factor, is described in Sect. 10.6.2. 

We have seen how to solve systematically a DE with constant coefficients by 
the method of variation of parameters. Step 1 requires us to solve the homogeneous 
equation, and Step 2 to vary the constant. A general linear first-order DE can be 
solved by a straightforward generalisation of this method. 


p(x)y' * a(x)y = f(x) 


Step 1: Solve the homogeneous equation 


dy 
май -0 
pata) 
dy | _4(*) a, 
y p(x) 
y n ) 
— = ]n = — dx+C 
01--/ 14666) 
fax) 
Hence y= Се Te dy (10.7) 


The function e/ (4/2) 1 — I (x) is called integrating factor, for reasons that 
will soon become clear. In some other references, the integrating factor 
is abbreviated as IF. [I(x)]~! is a particular solution of the homogeneous 
equation. 
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In order to solve the non-homogeneous equation, we will now vary the con- 
stant C . 
Step 2: Let the constant C become a function v(x). Assume that 


y = v(x)/I(x) solves the given equation, і.е. у = v(x)e * 7) 
Compute 
: 1 
y: ун 8) 00001 (v fu) 
I(x) p(x) (х) 10) 
Inserting this into the original equation gives 


ggr) = f(x) 


This equation allows us to compute v(x). Thus 


-fy 2 f(x) 
v) = fva [10975 ах 


The solution of the equation p(x)y' -- q(x)y = f (x) reads 
m 


We note in passing that the general solution of any first-order DE must contain one 
free parameter. In the case under consideration, this is the constant which arises in 
the last integration. 


Example у! +2 = 4x? 
Step 1: The homogeneous equation reads 


y 


y+—=0 
x 
Its solution is 
dy dx 
y Ни 
In|y| 2 —In|x| 4- C1 
€ 
у= = 


Step 2: Variation of the constant С = v(x). 


Assume y= aru 
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Inserting into the original equation gives 


LHS = y'+== = 
y+ х х2 Y x2 x 
RHS = 4x? 
/ 
Thus 2452 4x? 
x 
and v(x) = азах = С 


The general solution of the given equation reads 


C 
3 
х)-х7--- 
y(x) - 
Let us convince ourselves that this claim is correct (direct computation): 


C (S 
o FEER itx 


— = 4x? 
x2 


The DE is indeed solved by y(x). As the solution contains one free parameter 
(namely C) we can be certain that it is the general solution. 


10.6.2 A Straightforward Method Involving the Integrating Factor 


Remember that the integrating factor is the reciprocal of a particular solution of the 
homogeneous equation 
T 400 ах 
I ( x) = el P(x) 

In other words, C /7 (x) solves p(x)y' -- q(x)y = 0. 

The name integrating factor is justified by the following observation. If the given 
non-homogeneous DE is multiplied through by the integrating factor, then the LHS 
can be expressed almost as an ordinary derivative: 


I (x)p(x)y’ +I o)a(x)y = I(x) f (x) 


Observe that 


Thus POLY = 1x) (х) 
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The solution of the equation p(x) y’ + q(x)y = 01 


у(х) = mei 1(57 v) ах (10.8) 


а(х) 
where I(x) = el р(х) х 
For the sake of clarity, let us list the steps necessary for solving a linear first-order 


DE, p(x)y' +q(x)y = f (x), using this result. 
As a preliminary step, identify p(x),q(x) and f (x). 


[Ras 


and write down the integrating factor 


Step 1: Solve the integral 


(x) 
I(x) =e! 905 9 


Step 2: Solve the integral 


I(x) 
[sof dx 


and write down the general solution. The necessary constant emerges because of the 


last integration. А 163 
х 
ve) = a5 | Say fedex 


Example Solve \/xy’—y = 1. 
p(x)-vx, q(x)--1, f(x)=1 


Step 1: -f X --2үх, I(x)-2e ?V* 
Step 2: у(х) = е2Ух f e dx = e2V¥ (e 2V* + C) = Ce?V* —1 


The following example shows that the method just described can also be used for 
DEs with constant coefficients. 


Example The DE for the current in an electrical circuit consisting of an inductor 
L and a resistor R in series is given by 

di T R, E. 

— + =i = — sinw 

dí L L 
where Е sinwt is the voltage applied to the circuit. Solve the equation and discuss 
the solution. 
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i(t)= gr (2 [esos arte) 


The integral has to be evaluated by parts. 
(Remember that f u dv = uv — f v du.) 
This integral is, leaving out the constant 
Le(R/L)t 
R? + @? L? 
Step 3: The solution of the DE is 


(Rsinwt — oLcosot) 


E 


i= CREF uuu (Rsinot - wLeoseot) + Ce UP 

m 

E 
or i= sin(wt — æ) + Се (R/ Dt 

VR? co? L? 

wL 
h = tan"! | 
where a = tan (==) 


As t increases, the last term decreases and the current 7 tends to a steady 
periodic value. 


We conclude this section with a word of warning. The process of first determin- 
ing the integrating factor and then the general solution of a linear first-order DE 
is guaranteed to work in principle but not always in practice! The snag lies in the 
annoying fact that a given integral may not have an elementary solution. Thus it 
may well prove to be unavoidable to resort to numerical methods, even in cases of 
presumably innocuous DEs. 


10.7 Some Remarks on General First-Order DEs 
10.7.1 Bernoulli’s Equations 


The general Bernoulli DE for arbitrary n is 


у -q(x)y = f(x)y" (nz1) 
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Note that n may also be negative, but it must not be unity. A Bernoulli DE can 


be converted to a first-order linear equation by means of the substitution 
X 1 


Then u’/(1—n) y” = y',uy” = y; hence Bernoulli's equation becomes 


u'y” 4 q(x)uy" = f(x)y" 


l-n 


Dividing by y” gives 


ul +q(x)u = f(x) 


This is a linear first-order DE. Its solution has been shown in the preceding section. 


l-n 


Example у’ xy = —y?e-* 
This is a Bernoulli-type equation with n = 3. We put u = y ?, y = uy?. 
Hence 

/ 


1 13 
y 7739 Y 
Inserting these into the given DE gives 


1 
-5 у? —~xuy3=—y3e-* и +. 2xu = 2e 
This is a linear first-order DE for the function u. 
The integrating factor is (x) = ех . The solution for u is 


u(x) = Emu dx 22xe Y Сет 


After substituting this into y — u-V/?, we obtain the solution in its final form: 


1 1 х2/2 
y = —= = — e 
vu f2x +C 


10.7.2 Separation of Variables 


If the equation is neither linear nor of the Bernoulli type, then we may still be able 
to solve it using only elementary tools. The simplest case is when the equation can 
be rewritten with only y terms on the LHS and only x terms on the RHS. The DE 
is said to have separable variables when it can be written in one of the following 
equivalent forms: 
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The solution of such an equation is obtained by simple integration: 
Јо) а= = |а) ах=с 


Example The variables in the following equation can be separated: 


yx? = 2y? 
Dividing by x? y?, we obtain 
| j4.-2 : 1 2 
gj t 1.е. 32 rit 


This is an equation of the type required with p(y) = 1/y? and q(x) = —2/x?. 
Now, straightforward integration gives 


jr l 
ШИРЭЭГ, 
y x? iy 
dh х? 
an: ence = = 
> Cx2+1 


10.7.3 Exact Equations 


If, in a given DE, the variables cannot be separated, there is still a chance of finding 
an easy way to solve the equation. We must, however, refer to the basic concepts 
of partial derivative and total differential which are covered in Chap. 12. Logically 
speaking, it should be read beforehand. 


Definition Let p(x,y) dy+q(x,y) dx =0. 
If the following condition holds then the DE is said to be exact: 


др q 


ox ду 


Example 2хуу' + y? = x? 
This can be rewritten as 
2xydy + (y? - x?) dx =0 
We identify p(x, y) and q(x, y), so that 
P(x,y)=2xy, (х,у) = y?—x? 
Let us now check whether the condition of exactness holds: 


Op | Op | 
сл oy 


Hence the given equation is exact. 


2y 
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The LHS of an exact DE can be considered as the total differential of some 
function F(x, y): 


pdy+qdx=dF= e Mp ахар 
ду дх 
The equation is therefore solved by the functions y(x) which are defined implic- 
Шу by F(x,y) = С = constant. The obvious question is how a suitable func- 
tion F(x, у) may be found for any given exact DE. We will describe in general 
terms a method for finding such a function. Later we will refer back to the example 
just given. 
The starting point can be either one of the following equations: 


OF OF 
Y р(х,у), РРБ q(x,y) 


Let us choose the first one. 


Step 1: From this first equation we find, by integration, that F = f p(x, y) ау +С. 
The constant of integration C is an, as yet, undetermined function of x only, 
i.e. C — v(x). The reason is that any such function vanishes if the partial 
derivative д/д y is taken. 

Step 2: In order to determine v(x), insert F into the equation d F /dx = q(x, y). 
This yields a differential equation for v(x), i.e. 


OF д 4 
зо 5; f pœ») dy + grw = q(x,y) 


Note that since v(x) is a function of x only, the partial derivative 0 / (0 x)v(x) 
equals the usual derivative d/(dx) v(x). 


п) = f la) - f pon) ay] s 


Step 3: Insert the result of the last integration into the equation for F. 


Exact DE p dy +q dx = 0. It is solved by functions у(х) which are given 
implicitly by F(x, y) = С = constant. 
The function F(x, y) can be obtained in either one of two ways. 


If we start with the equation oF = p, then the formula reads 
д 
Р-Гр»д-| а 5 | рау ах (10.10а) 


If we choose to start with the equation ae = q then the formula reads 


r= [аа | |0 5 Јаз) ау (10.106) 
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We will now solve the equation given in the previous example; it has already 
been proved to be exact. The equation is 


2xyy + y? =x? 

Step 1: 

р(х,у) = 2ху 

Е = [oy dy = xy? + v(x) 

Step 2: 

4(х,у)--у?-х = a -y v) 

v(x) = SE =-— 

Step 3: 


x 
F(x,y) cay ES 


Therefore, the general solution of the given DE is 
x? x C 


xy-7R-C, Le. E ben 


It is not hard to verify that these functions do indeed solve the equation. 


Example Solve the DE 
d 
yor +х= +y x2 + y? 
Even though this is not an exact equation, the notion of exactness aids us in finding 
solutions. 
Rearranging the equation we have 


хах + уйу 
ty/x2+ y? 


Inspection reveals that ће LHS is a differential, i.e. 


d (+Vx?+y?) =dx 


=dx 


By integration, we find 


t\/x2 4+ y2 =x+C 


y? =2Cx+C? 


Squaring gives 


This is the equation of a parabola. 
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10.7.4 The Integrating Factor — General Case 
If the given DE is not exact, then sometimes it is possible to turn it into an exact 


equation by multiplying it by a suitable function u(x, y). This function is also called 
an integrating factor. 


Example Suppose the given DE is 
(xy - y y? 20 


If it were exact then we would have 


Bp = gd 
дх oy 
Now, p = xy — І and q = y?, so 
дру 3, 
Ox Q y 


It does, however, become exact if it is multiplied by u = 1. 


y 
The DE (x — sy ty = 0 is exact. 


Proof p — x — + апі = y, so 


9p .j..94 
ox | Oy 


This equation can therefore be treated as described in Sect. 10.7.3. 
How can an integrating factor u(x, у) be found in the general case? In order to 
provide the answer, we must formulate the problem mathematically. 


др , 94 
ax” dy 
Wanted: ш(х, y) such that epu) = 9090) 
Ox ду 
If an integrating factor jz exists, then it must satisfy the following condition which 
derives from the equation above and from the product rule: 


1 


Given: p(x,y)y' 4- q(x, y) =0 with 


Op ди. 94 ди 
Oe ICE Oy ay, 


This is a partial DE, and it would seem less easy to solve a partial DE when it is 
not possible first to solve the ordinary DE. However, we do not need the general 
solution of the partial DE: any non-zero particular solution u will suffice. 

Even though we can offer no general advice on how to find the integrating fac- 
tor u, there are two important special cases in which u can be readily obtained. 
Below we state these without proof. 
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Special case 1 


pt LR = f(x) is a function of x only, then u = ef / 6) 4, 
p\oy Ox 


Special case 2 


1/9 д 
Р 1) = g(y) is a function of y only, then и = ef eo) dy, 


Example Let us return to the equation encountered in the last example, i.e. 
(xy - Dy 4 y? 20 


Both 9р апа tt are functions of y, as is g. Therefore from special case 2, we get 


g() =-5 ; во) =—In|y|+C 


The function p(x, y) = u(y) = e- ^»! = 1/|y| is an integrating factor. u has al- 
ready been used above as an integrating factor. 
Finally, we can solve the equation: 


1 
(s->)y'+y=0 
Ў 


It is exact, апа we must now find a function F such that 


The solution is obtained by the method outlined in Sect. 10.7.3. It reads 
F(x,y) = xy -n|y|- € 


Therefore, the general solution of the given equation is the class of functions which 
is given in implicit form by 
xy-In|y|2 C 


Let us note in passing that the method outlined in this section is a generalisation of 
the technique covered in Sect. 10.6.2. You are invited to prove for yourself that if 
a linear first-order DE is treated as proposed here, then u(x, y) = u(x) = I (x). 

Hint: Use the normalised form of the equation, which means that it has been 
divided by the first coefficient: 


(х) f(x) 


P(x)” p) 
It is, admittedly, not altogether satisfactory that we are unable here to present a more 
general procedure for finding the integrating factor u. For a more extensive treat- 
ment, the reader is referred to the standard treatises on DEs. 
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10.8 Simultaneous DEs 


Often problems arise involving several dependent variables. They give rise to a set 
of differential equations. The number of equations corresponds to the number of 
dependent variables. 

We will restrict ourselves to the solution of simultaneous first- and second-order 
DEs with constant coefficients and two dependent variables. Before considering 
practical problems, let us look at the form of the equations. 

Let x and у be the dependent variables and f the independent variable. These 
quantities are related by means of a set of simultaneous DEs such as 


d 
Ar tay +bx = f(t) 


ау (ау 
А 
di? Tr dt 


dx 
B = e(t 
+ Ba +Cy = g(t) 


where a,b, A, B and С are constants and f(t) and g(t) are functions of the inde- 
pendent variable ¢ only. 
To illustrate the method of solution, consider the examples that follow. 


Example Solve 


dx 

—+5x-—3y = 0 1 

di 5x —3y [1] 
dy 
— +15х – 7у = 0 2 
di 5x— 7y [2] 


First method If we want to solve for x first we must eliminate y and dy /dt from 
these equations. 
Differentiating Eq. [1] with respect to t gives 


d?x dx dy 
5 3 
dt? S df df 


Inserting the expression for ay from Eq. [2] and the expression for y from Eq. [1] 


we obtain 3 
ах dx 
— —2—+10x = 0 
dt? de * 
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We know how to solve this equation. Its solution is 
x = e'(Acos3t + Bsin3r) 
We can now obtain the solution for y from Eq. [1] or Eq. [2]: 
y =e'|(2A + B)cos3t + (2B — A)sin3t)] 
You should note that there are only two arbitrary constants. 


Second method We saw earlier that to solve a DE we assumed a solution e”* and 


found the values of r which would satisfy the equation. There is no reason why we 
cannot use the same method for simultaneous DEs. 

Hence let x = ае’, y = be”. 

It follows that 


Substituting in Eqs. [1] and [2] we have 


[(r + 5)а —3bJe" = 0 
[15a + (r – 7)b]e"* = 0 


Since е”? Æ 0, it follows that 


(r+5)a—3b = 0 [3] 
15a c (r— 7)р = 0 


= 


To calculate the value of r we must eliminate а and b. Hence 


(r+5)(r—7)+45=0 
or r?—2r+10=0 


This is the auxiliary equation we have met before. Its roots are 
ry =1+3j) and /2-1-3) 
x is then given by 


x =e! (a1e + ane 37) 


or x = ef (A; cos3t + А2 ѕіп 3/) 
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Similarly 
у= е! (bie bae Sit) 
or y = e' (B4 cos3t + B5 sin3t) 
b; is connected with ат, and bz with az by Eq. [3] or Eq. [4], i.e. 


USER. b ERES 
== 3 1 2 3 2 


We now consider two further examples taken from electrical and mechanical engi- 
neering. 


by 


Example Two electrical circuits are coupled magnetically. Each circuit consists 
of an inductor and a resistor; a voltage is applied to one of the circuits. Applying 
Kirchhoff’s law to each circuit, the equations relating the two currents іу and i5 
(measured in amps) are 


HA «eli 

PH y eU n d SE 
Vat dí 111 1 
diz di 

i. gap e qm zu 
? di dn one 


Lı, Lo are the values of the inductors, in henries. А, R2 are the values of the resis- 
tors, in ohms. M is the coefficient of mutual inductance, in henries. E, is the applied 
voltage, assumed constant in this instance, in volts. The independent variable t is the 
time in this case. 

Proceeding as in the previous example, we assume an exponential solution for 
the complementary function, so that 


ij = Ael! , in = Belt 
Substituting in the DE we find 


(Lir+ Rı)A+ MrB =0 
MrA+(Lor+ R2)B = 0 
Eliminating A and B from these two equations gives the auxiliary equation, i.e. 
(Lir+ Ri)(Lar 4- R2) —-M?r? 20 
ie. (LiL2 - M?) r? c (LiRo  LoRi)r - ЕЕ = 0 
The form of the solution will depend on the nature of the roots of this equation. We 


saw in Sect. 10.3.1 that the roots can be (1) real and unequal, (2) real and equal or 
(3) complex conjugate. 
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If Case 3 applies, for example, the solution is 


E 
ір — e “'(A,cosbt+ Ву sin bt) + a 
1 


E/R; is the particular integral and i5 = e 2 (A5 cosbt + B2sinbt). Remember 
that Az and A1, B» and P, are related. 


Example This example concerns the calculation of the translational natural fre- 
quencies of a particular two-storey building whose idealised mathematical model 
leads to the following DEs: 


150003, +31 x 107x1 —6 x 107x4 = 0 
8500x5--6x 107x4 —6x 107x, = 0 


where Хү and x» are the displacements of each floor under free vibration conditions. 
The dot notation, as you will remember, refers to differentiation with respect to 
time f. 

The method of solution in this case is approached in a different way from that of 
the previous examples because of the vibratory nature of the problem. 

We could assume an exponential solution as before. 

Instead, let 

X1 = Aycos@pt, X2 = Аз cosogt 


Therefore 
Хүэ =e Ay COS@pt, Хә = —w? A3 COS pt 


€ is the natural frequency. We are using this method of solution because a vibration 
can be represented by a sine or cosine function. We have already discussed some 
aspects of vibrations in Sect. 10.5.2. 

Substituting in the differential equations we have, after dividing by 10^ 


(—1.5@2 +31 x 10°) 4; —6 x 10045 = 0 
—6 x 10241 + (-0.8502 + 6 x 107) 45 = 0 


Since we are concerned with the values of the natural frequencies wn, we eliminate 
A and А2 from these two equations. Hence 


(31 x 105 — 1.592) (6 x 10? — 0.8502) — 36 x 10° =0 
Expanding and collecting terms gives 
1.27504 — 35.35 x 10302 + 150 x 10° = 0 


known as the frequency equation, whose roots are w? = 22495.7, 02 = 5229.75. 
Therefore the two natural frequencies of this building are 150 rad/s ог 23.87 Hz апа 
72.32 rad/s or 11.5 Hz. 
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As a matter of interest, this was a real problem! Dangerous vibrations were ob- 
served when the building was commissioned due to a fan operating on the first floor 
at a frequency of 12 Hz. 


10.9 Higher-Order DEs Interpreted as Systems 
of First-Order Simultaneous DEs 


Any DE of order n can be transformed into a system of n simultaneous first-order 
DEs. In fact, this is no more than a way of rephrasing the problem: it does not 
lead us any closer to a solution. However, it can be quite useful in a number of 
circumstances. If we wish to solve a DE by numerical methods, we find that first- 
order DEs are much easier to handle than higher-order DEs. 

Let us start with a linear second-order DE 


py" +qy' +ry = f (x) 


The fundamental idea is to introduce a new function, u = y’. The given equation is 
then equivalent to the following pair of first-order simultaneous DEs: 


pu'+qut+ry = f(x) 
y =u 


The general case is treated quite similarly. Given a DE of order n, we introduce n — 1 
new functions, u1 = y’, u2 = ui! = y" ,u3 = из! = y" ,... By inserting the us for all 
higher-order derivatives of y, a first-order DE (for the n functions, и1,---,Ми-1,У) 
is obtained. In conjunction with the defining equations for the us, we have a system 


of n simultaneous first-order DEs. 


10.10 Some Advice on Intractable DEs 


The sorts of differential equations discussed in this chapter can give only a glimpse 
of the subtle ideas involved in this topic. Sometimes a judicious change of variables 
may provide the answer, but each equation requires individual attention. Should you 
be confronted with severe problems, two routes can be taken. 

On the one hand, the remedy might be provided by more powerful theoretical 
means. A very worthwhile subject, which could not be included in great detail in 
this book, is the theory of Laplace transforms. This is outlined in the following 
chapter. 

On the other hand, if you are only interested in numerical data, then computer 
methods can provide the answer swiftly and reliably. 
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In any case, the first step when encountering a new DE is to classify it according 
to the criteria: What is its order? Is it linear? What types are its coefficients (constant 
or variable)? Is it an ordinary DE or a partial DE? Only then will you be able to use 
other sources of information efficiently. 


Exercises 


10.1 Concept and Classification of Differential Equations 


1. Which of the following are linear first- and second-order DEs with constant 


coefficients? 

(а) y' 4- x?y 22x (b) 5y^ —2y'— 4x —3y 
(c) y^42y" +3у' 20 (d) sinxy"— у= 0 

(е) ух? = 2 (f) 2y" yt Sy =0 


2. Which of the following are homogeneous and non-homogeneous DEs and 
what is the order in each case? 


5 2 1 

Й = D meu SS 

(a) y" cax 20 (b) 37 t3" 27 
(c) 2y' = Зу 297032 


1 
4 Й D : 2 
(4) 107 ted gy sinx = 0 
(е) 3y" +у' = 2y 
10.3 General Solutions of First and Second Order DEs 
with Constant Coefficients 


3. Solve the following DEs. In the case of complex roots give the real solution. 


(а) 2y" — 12y' -10y 20 (b) 4y" —12y'+ 9y =0 
1 5 
(с) у" +2у'+5у = 0 (d) oe) ЭР ey >? 
1 1 
(е) p 1957 -2y-0 (f) 5y" —2y’+y =0 


4. Solve the following DEs: 


1 
(a) 2y'+8y =0 (b) y= 6y (c) 3y' = 6y 
5. Obtain the general solution of ће following second-order DEs: 


(a) S"(t) = 2t (b) x"(t) = —o? coswt 
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6. 


7. 


8. 


9. 


Given the following non-homogeneous DEs, obtain the particular integral 
using a trial solution. 


(a) уу +у 22x -3 (b) y" +4у'+2у 22x -3 
Obtain the general solution of the following non-homogeneous DEs: 
(a) Ty” — 4y' - 3y =6 (b) y" — 10y' - 9y = 9х 

(c) 3" y'-4y = x? (d) y" +2y'+5y =cos2x 


A particular integral ур(х) of the following non-homogeneous DE is known. 
Check that it is a solution and obtain the general solution of the DE. 


I. OE NEC LS 28.18 43 

gy 30 tga 45 -Lb іх) = 79% 45559 124 
10.4 Boundary Value Problems 
Solve the following DEs: 


10. 


11. 


12. 


1 
(a) 3» + 2y =0 (given that y(0) = 3) 


4 
(b) 777 $y =0 (given that y(10) = 1) 


The DE 1/3y' — 2/3y = 0 has for its general solution y(x) = Ce?*. Calcu- 
late the value of the constant if 


(а) y(0) =0 (b) у(0) = –2 
(c) y(-1)=1 (d) y'(-1) 22e? 
Solve y" + 4y = 0 for the following boundary conditions: 

л л (NL 
@ у(0)-0, »(2)-1 © ¥(F)=-1, У(52)-1 
(с) у(0)-0, y'(021 Ф)(2)-4, у"(0) = b 


Solve y" + y = 2y’, given that y(0) = 1 and y(1) = 0. 


10.6 General Linear First Order DE 


13. 


14. 


Solve the following first-order linear DEs: 


Y ( у= 24x 
X X 


(с) у’ + ytanx = sin2x (d) ху +(1+х)у= xe^* 


(а) ху = 2у - x 


Verify that ће following DEs can be brought into the form of Bernoulli-type 
equations and solve them. 


2 


5 
(a) y! xy = xy? (b) »'— y=-y 
x2—1 


2 
» +x+1=0 


(с) x?y?y' c xy? = 1 Е 
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15. 


16. 


17. 


18. 
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In the following DEs, the variables can be separated. Solve 
(а) y'= e-2 (b) y xy 5 —0 
y 
(с) ху'+ (Inx)y? =0 (d) (y)? + xe* y' - xe* = 1 


Verify that the following are exact DEs. Find F and solve. 


2) 2 
(a) Zya (a- 5) =0 (b) (1—xe~”) y/ +e =0 
(с) (2y ^ x?sin2y)y' 4-2xcos? у = (d) (2x —3)y'- 3x2 -2y 20 
0 


You will remember that the integrating factor u(x, у) fora DE р(х, y) y! + 
q(x, y) = Ois easy to find in special cases. Solve the following equations by 


finding an integrating factor, ш, and then solving the exact equation. 
(a) sin yy! — cos y = —e?* 


(b) (е? —x)y’+1=0 
Solve the following simultaneous DEs: 


(a) x'—7x4y-0 (b) x’+ y'+2x+y=0 
y'-2x—-5y-0 у'+5х+3у = 0 


Chapter 11 
Laplace Transforms 


11.1 Introduction 


In Chap. 10 we learned how to solve certain differential equations of the first and 
second order. We now consider a special technique for the solution of such ordinary 
differential equations known as the Laplace transform. It was first introduced by 
the French mathematician P. S. de Laplace in about 1780. The main advantage of 
the method is that it transforms the DE into an algebraic equation which, in many 
cases, can be readily solved. The solution of the original DE is then arrived at by 
obtaining the inverse transforms which usually consist of the ratio of two polyno- 
mials. The transforms and their inverses can be derived or obtained by consulting 
a table of transforms. We shall build up such a table of the functions frequently 
met in practice. The method is particularly useful in the solution of DEs whose 
boundary conditions are specified at a particular point and it is extensively used in 
the study of electrical networks, mechanical vibrations, impact, acoustics, structural 
problems, control systems, and in many other fields. 

It is also used to solve linear DEs of any order, linear DEs with variable co- 
efficients, linear partial DEs with constant coefficients, difference equations and 
integral equations. 

In this chapter, however, we shall restrict ourselves to an introduction to the tech- 
nique, and solve first- and second-order DEs with constant coefficients. 


11.2 The Laplace Transform Definition 


The Laplace transform .[ f (1)] of a given original function f (t) for values 
of t > 0 is defined as 


0] | е0) а = FG) (11.1) 


K. Weltner, W. J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
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This means that we take the given function, multiply it by е $, and integrate 


between the limits f = 0 to f = oo. 5 is a number which may be complex but whose 
real part is positive and sufficiently large to ensure that the integral is convergent. 
Since the value of the integral depends on s, the Laplace transform is a function of s. 

As the reader certainly knows any functions may be denoted in different ways, 
say f(t) or y(t); f(x) or y(x), etc. In addition to that, please note that for the 
Laplace transform we will use two different notations, either by capitalisation e.g. 
F (s); Y (s) or by adding a bar to the function e.g. f or y. 

Proceeding in the opposite direction, i.e. finding the original function from 
a given Laplace transform is called finding the inverse transform. The inverse trans- 
form is denoted by .Z!. 

The inverse Laplace transform generates the unique original function from 
a given Laplace transform 


£ [F(s)]- f(t) 

Since the computation of the inverse transformation needs knowledge of theory of 
functions which is beyond the scope of this textbook we will desist from computing 
the inverse transformations. This does not limit in practice since as a rule there exist 
tables of Laplace transforms and especially tables of inverse Laplace transforms 
which enable us to solve problems. The table at p. 333 gives the inverse Laplace 
transforms for most functions needed for applications in physics and engineering. 

Before we can appreciate the usefulness of the Laplace transform, we need to de- 
rive the transforms of some of the more common functions encountered in physical 
problems. 

These functions are 


. A, a constant. 

. e% , an exponential function with the constant а real or complex. 

. Sinwt,coswt, periodic functions where w is usually a frequency. 

. At, a linearly increasing function where, in practice, ¢ is usually the time. This 
function is known as a ramp. 

5. tsinwt,tcoswt, periodic functions whose amplitudes increase linearly with the 

independent variable f. 
6. e” sinc, e^ coswt, an exponentially increasing or decreasing oscillation, de- 
pending on whether a is positive or negative. 


ошо м re 


We also need to know (ће transforms of the derivatives of these functions: 
d" y 
dt” ' 


n=1,2,... 


11.3 Laplace Transform of Standard Functions 


We will now derive the Laplace transforms of the functions mentioned above. 
When evaluating the transforms, i.e. solving the integrals, the quantity s is re- 
garded as a constant. 
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1. y(t) = A, a constant 


2? es! 99 A 
y(s) = | л а=а[ e == (11.2) 
=S 10 5 


2. y(t) =e% , with a real or complex 


oo 2: -(s-a)]^ 
у) = | est ей! a= | e- 6-9! qr = | Ê E (11.3) 
0 0 —(s—a) o 574 


Note that s > real part of a for the integral to be convergent. 


3. у(1) 2 sinot and y(t)= coswt 
The simplest way of obtaining these transforms is to make use of 2. 
Remember: 


1. . 
t)=si f= — e?! _ pa jot 
y(t) =sinw 37 e ) 


Hence, from 2., we have 


1 1 1 w 
у(5) = = 11.4 
Уб) 2) (= =) 52 +? oe 


We obtain the Laplace transform of the cosine function in the same way. 


ic 
y(t) = coswt = x" +e J!) 


F(s) = 


s 
52 +w? 


= Ate 5! а-АГ te 5 dt 
1 А 
p Е zin ZI Е 5 (11.5) 
5 0 S JO 


when integrating by parts. 
The first term is zero since e 


t — оо, 


(11.4b) 


^5! decreases more rapidly than f increases as 


Before proceeding further with transforms of functions, we will consider some im- 
portant theorems, the first enables us to extend the list of transforms. 


Theorem I: The Shift Theorem 


If y(t) is a function and y(s) its transform, and a is any real or complex 


number, then (s +a) is the Laplace transform of e “ y(t) . (11.6) 
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Proof The Laplace transform of e~™ y(t) is 


fete) dt = ] 9*9 dt = f(s +a) 


Thus we see that we simply replace s by (s + а) wherever s occurs in the transform 
of y(t). If a is negative, then we can show that s is replaced by (s — a). 
As an example, let us find the transform of 


t 


y(t) 2 e *' sinot 


The transform of the sine function is given by y(s) — E 
Applying the shift theorem gives 


(0) 


y(s) = ——.,—_; 11.7 
») (s +a)? 4+ 02 doi 
Similarly, if y(t) = e~ cost, the shift theorem gives 
S+a 
y(s) = 5—5 11.7b 
») (s +a)? + 9? ( ) 


Example Obtain the Laplace transform of y(t) = 3e?' cos 10r. 
a=-5, 010 
8-5 2, 3(s—5) 
(5— 5)24+102 52 — 105+ 125 


У(5) = 3 


Let us now continue with the derivation of the Laplace transforms of frequently used 
functions. 


Э: y(t) = tsinot 
2 205 
Е > ТИЕ нэ 
¥(s) = n e 5 t sinot dt = (52 +.02)2 (11.8) 


Even though the result is already stated, we wish to know how it is arrived at. 
Looking at the integral, you will realise that the task of evaluating it is not a straight- 
forward one. The following theorem will be of considerable help. 


Theorem II: Transform of Products гу (7) 


If y(t) is a function and y(s) its transform, then the transform of the new 
function ty (t) is 


Ziya] = = <[7(8)] (11.9) 
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Proof 


Sb = (Гез аг) = [eoa = 21000) 


This is due to the fact that we can differentiate under the integral sign with respect 
to a parameter. 
Let us go back to y(t) = tsinct. The transform of the sine function is known 
to be 
-Z'(sinot) = ———; 
( ) 52 o? 


Hence, by theorem II, the transform у (5) of t sinwt 18 


: 4 w 2ws 
шашин ds E ~ (82 + 02)2 


Similarly, the transform of t cos wt is 


d 8 s2— 2 
-Z (t cosot) = Ч (222) = RENE (11.10) 


We can extend the use of theorem II. For example, 
d 
L(t? coswt) = LY[t(tcoswt)| = Nr {Z (tcosot)) 
5 
d ( 52—02 N  2s(s?—o?) 
( 


ds V(s2--92)?/ — (s? 92)? 


If f(t) is a function апа (s) its transform, then the transform of the new 
function is y(t) = t" f (t) 


(s) =(-1)" IF (9) (11.11) 


6. y(t) — t", where n is a positive integer. 


eo ! 
E нэ —st,n 211 


As before, the result is already stated but the proof is missing yet. We start by writing 
the function ¢” as a product: 


y(t) 2 t"-'t 
The transform of the function t is known to be 1/52. 
Now, using the general result (11.11) we find 


_, dt! 1 E 1 n! 
саа, (=1)”=1(—2)(—3)...( M gun 


487-1 s2 


ӯ(8) = ( 
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For example, if y(t) = 12, then y(s) = 2/s?. 
For the sake of completeness, we will mention another theorem which, in fact, 
has already been used implicitly. 


Theorem III: Linearity 


Linearity of the Laplace transform 
Let y(t) be a combination of functions: 


y(t) =Af(t)+ Bg(t) (А, В are constants) 


Then the Laplace transform is the corresponding combination of the trans- 
formed functions: 


y(t) = Af (t) + Bat) (11.12) 
or LIAL E) + Bg(t)] = AZIA] + BL[e(0)] 


The proof is obvious. It follows from the linearity of the integral. 
As a particular case, note that a constant factor is preserved by the Laplace trans- 
form: 


PIAFO] = A-Z|f O] 
For example, the transform of sin wt 18 w/(s* +w?) and the transform of t is 52 
Therefore, the transform of —6sinwt + t is —6w/(s* +02) + 29 


Theorem IV: Transforms of Derivatives 


First Derivative of a Function y(t) 


$ (nf em 
ЕЕ ot di 


We find, putting a = ý i.e. using a dot to denote the derivative with respect to 
time that 


By definition 


Í, еу = 1-1 y(75e 7?) dt = —y(0) +59 = 55 — y(0) 


This result holds for those functions for which e^! y(t) — 0 as t — ee. 
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2 а) -s$(9)—»(9) (1.13) 


where у(0) is the value of the function at t = 0 (initial value or initial condi- 
tion). 


Second Derivative of a Function y(t) 


2 Е 5 22 
262 -| eS ý dt = pev] esf e 5 y dt 
= —}(0) – sy(0) +575 
Zï] = s?y — sy(0) – (0) 


where (0) is the value of the first derivative at t = 0. 
By repeating the process we can show that 


L(Y (t)] = 525 —s?y(0) — (0) — »(0) 


where y(0) is the value of the second derivative at t = 0. 
The following notation for the values of the function y(t) and its derivatives at 
t = 0 15 commonly used: 


уо = y(0) is its value att = 0, 
yi =y(0) is the value of the first derivative att = 0, 


y2 = y(0) is the value of the second derivative att = 0, 


yn = y™ (0) is the value of the nth derivative at t = 0. 


For example, the Laplace transform of the 4th derivative will be 


[y 9 (0)] = s*y —s3y9 —52y1 — sy2 — уз 
Transforms of Derivatives 


[y (no s"y- Y shy; (11.14) 


A table of transforms, and a table of inverse transforms, will be found in the 
appendix to this chapter. 
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11.4 Solution of Linear DEs with Constant Coefficients 


Suppose we have to solve the DE 


with initial conditions 
у= уо, 2252 at 1-0 
If we multiply the equation throughout by e ^^ and integrate each term for t = 0 
to t = œ, we in fact replace each term by its Laplace transform. In doing so we 
transform the DE into an algebraic equation in terms of the parameter s. 
Using the table of transforms, we find 


s2y—syo— yi A(sy — yo) -By 5 f 


5 


Solving for y gives 

f + ѕуо + Ayo + ут 
52 А5 + B 

All we need do now is look up the inverse transform. The reader will notice that we 

do not have to find the values of arbitrary constants; but we may have to express y 

as a partial fraction or in a form from which the inverse can be found easily. 


v= (11.15) 


Example Solve the equation 


dy -21 
< 44у = 
qr +4y =e 


given that у = 5 when t = 0, i.e. уо = 5. 
The transformed equation is 
y +4y : 
sy— =o 
у= yoT ^y PEE 
Solving for y gives 
zo wtll _ 1 J. 9 
У (6--2)(8--4) 2654-2) 2(44) 


From the table, we can look up the inverse transform. Hence the solution is 


"mr 
y= 29 + 29 
Example Solve the equation 
y+5y+4y=0 


given that y =O and y = 3 at t = 0. 
From the table we find 


11.4 Solution of Linear DEs with Constant Coefficients 


s*¥—syo—yitd5(s¥—yo)+ 47 =0 
———— —— Уо 
L(Y) 2(5ӱ) 224») 


The initial conditions аге yo = 0, y; = 3. 
Hence 


25— y y= 
78 527—3 +559 +49 = 0 


У(52--58--4)-3 


Solving for y gives 
3 3 21 1 


4524-5444 (54448401) (841) (544) 


y= 


From the table we get 


Example Solve the equation 
y+8y+17y=0 if y=0, у-3 at t=0 
The initial values are уо = 0, y, = 3. Hence the transformed equation is 


s2y -34-8sy -17y =0 


Solving for y gives 
3 3 


52-85-17 (4241 


y E 
From the table we find 
y= Зе 4 sint 


Example Solve ће equation 
y+oy=t 


The initial conditions are y = 0 and y = 1 att = 0. 
The transformed equation is 


1 
a = 
-1+6у = > 
5-у +6у zZ 
1 14:5? 
Theref у(52--6) = —41- 
erefore y(s^ +6) zt E 
Solving for y gives 
5241 5 1 


<1 


7-52(52--6) | 6(52--6) Ta 


From the table of inverse transforms at the end of this chapter we find 


1 5 1 1 5 
= + x — sin V6t = - t+ Sesin vir) 
77:67:16 6 1 V6 
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11.5 Solution of Simultaneous DEs with Constant Coefficients 


In physics and engineering, we frequently encounter systems which give rise to si- 
multaneous differential equations, e.g. an electrical network consisting of two loops, 
or a double spring—mass system. We will now illustrate their solution by means of 
the Laplace transform technique. 

If x(t) and y(t) are two functions of the independent variable ¢, their transforms 
are denoted by x(s) and y(s), respectively, or simply Х and y. 


Example Solve the equations 
3x+2x+y = 1 
х +47 +3у = 0 


The initial conditions are х = 0 and у = О at t = 0 
Transforming the equations gives 


бо | = 


3(sX — xo) -2X +s — yo = 
sX — хо - A(sy — yg) -3y = 0 


but xo = 0 and yo = 0 
Hence we obtain a pair of simultaneous equations in X and y: 


1 
K(3s+2)+ ys = т 
х= + 7(45 +3) = 0 


Solving for X gives 


: (48--3) [Ro 3 
x= = 
s(s+1)(1ls+6) 2s S(s+1) 10(s+6/11) 
Hence i 4 
21 14 D -6t/11 
Pug Be 20108 


Solving for y gives 


Е -1 1 1 1 

V= r1146 5 (A si 
Hence 
(e: —e76t/11) 


Ys 


Example Solve the differential equations 
X¥+2x-y=1 
xX+¥+2y = 0 
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The initial conditions are x = 1 and X =y=y=Oatt=0. 
Transforming the equations gives 


1 1 
(s2--2)&—sy = -+sx0o= -+s 
5 s 
5+ (s24-2)y = хо = 1 
Solving these two simultaneous algebraic equations in х апа ӯ we find 


З-Д 1 5 5 


ауа) 2 395/31» 60284 


Looking up the table of inverse transforms at the end of this chapter we have 


I ub coe codor 
= — = COS - COS 
MS Ed 6 


Also 


= 1 1 1 1 
dan (s?+1)(s2+4) 3 (= zi) 


and from the table of inverse transforms we get 


li t li 2t 
= -sni — — Sin 
pg 6 
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Appendix 


Table of Laplace Transforms 


f(t) or y(t) LISE) or F(s) 
A 
A БЕН 
5 
eat I 
5-а 
n n! 
t" (n = 0,1,2,3,...) SHE 
шэн w 
sin a 
532192 
5 
ot —— 
cos Sio 
tsinwt 205 
sin yy 
(s2 + @2)2 
s? — w? 
t coswt (240272 
: w 
sinh wt 3-3 
5 
cosh ot 27-02 
t sinh wt 2035 
sin > Or 
(s? — 92)? 
. w 
sinh cot 12-42 
cosh ot PED 
timor 2008 
sin t 
(82 —w2)2 
2 2 
S“+@ 
1 coshot CESE 
ea y(t) S(s +a) 
d” 2 
2210 CD c 3G) 
t = t 
>00 А y(s)ds, if lim 20) exists 
t Js 1—0 t 
sin@wt 109 
tan = 
t 5 
»(t) Sy — yo 
(t) 82У-8У0-У1 
Y(t) 53ў —s?yg —syi — ya 
а" р п-1 бог 4 
TAA ) s" y(s)— 20 i ae ) 
i= 
t А 
[уо »(s) 
JO 5 
kei? kei? 
2ke™ cos(wt + 0) hs : 
5-0-10 sina+jo 


Appendix 


Table of Inverse Laplace Transforms 


F (s) or y(s) £ '[f(s)-f)o'[l-»(t) 
A 
v A 
1 (7-1 
sv (n — 1)! 
1 eat 
5-а 
1 t^- legat 
(s—a)" (n — 1)! 
1 1 at bt 
(s —a)(s — b) TES да 
5 1 аі b 
moi 
(s —a)(s — b) pur RR) 
! ot 
—sin 
532192 
à cos wt 
32192 
1 dic. 
(s-a) +o? —e" sinof 
( T 5 ей! сов ct 
5-а)? +w 
1 
eee) —5(1-—cosot) 
: : (wt —sinwt) 
52(s2 + w2) 3 
! (sin@t — wt соѕ 01) 
WA LOU sin@t — wt cos 
524092)? 203 
5 [ox 
22 rel 5:5 0 0f 
x : (ѕіп 07 + ot cosot) 
от — (sin cos 
52 + w2)? 20 
i: 1 w2? (cos 017 — cos wt) 
52 40012)(52 +022) 0022 — @? 
1 1 
— sinhot 
Fo PL 
2 cosh ot 


333 
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Exercises 


11.3 Laplace Transform of Standard Functions 


1. Obtain the Laplace transforms for the following functions: 


(a) 1/41? (b) 5e?! 
(c) 4cos3t (d) sin?t 
2. Obtain the inverse transforms for the following: 
1 1 2 
SS b) ——— oS лэ 
(a) 4s2 +1 (5) 8(5--4) (© s(s2 +9) 
6 1 4 
(d —— ©) тат Рута 


11.4 Solution of Linear DEs with Constant Coefficients 
3. Solve the following differential equations: 


(a) f+5y+4y=0 (initial conditions: y = 0, y = 2 att = 0) 
(b) ¥+9y = 51021 (initial conditions: y = 1, = —1 att = 0) 
(c) y+2y=cost (initial conditions: y = 1 at t = 0) 


252 —3s 4-4 


4. If  — 3y --2y =4 and y —2, y =3 att = 0, show that y = ————————., 
s(s— 1)(s—2) 


and hence find the solution for y. 


5. Given У + y =e! +t + 1 with the initial conditions y = 0, y = 0, y = Oat 
t = 0, obtain y. 


11.5 Solution of Simultaneous DEs with Constant Coefficients 
6. Solve the following simultaneous equations for y: 
у7-2Х-у-хХ--25 
2p+x = 25e! 


Initial conditions: y = 0,x = 25 att = 0. 
7. Solve for y and x given 
4x—ytxr=1 
4x—4y—y-0 


Initial conditions: x = 0, у = O at t = 0 
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8. An electrical circuit consists of a capacitor, C farads, and an inductor, 
L henries, in series, to which a voltage E sin wt is applied. 
If Q is the charge on the capacitor in coulombs show that 


E w w 
L(@?—1/LC) |s2+1/LC 52 +0? |” 


Q= w LC #1 


and hence calculate Q given that C = 50 x 1076F, L = 0.1 H,w = 500rad/s, 
E=2V and Q = О = 0 att — 0. 


Chapter 12 


Functions of Several Variables; 
Partial Differentiation; and Total Differentiation 


12.1 Introduction 


So far we have dealt with functions of only a single variable, such as x,t etc. But 
functions of more than one variable also occur frequently in physics and engineer- 
ing. 


Example A voltage V is applied to a circuit having a resistance R, as shown in 
Fig. 12.1. 


Fig. 12.1 


What is the value of the current which flows in the circuit? 
According to Ohm's law, the current / depends on the resistance R and the ap- 
plied voltage V, i.e. 


1= = 
К 


Hence J = I (V, R), which is a function of two variables. 


Example A gas is trapped inside a cylinder of volume V. The gas pressure on 
the cylinder walls and the piston is p and the temperature is T (see Fig. 12.2). 
The following relationship between volume, pressure and temperature holds true 
for 1 mol (6.02 х 1023 gas molecules): 


pV = КТ 
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where R, the gas constant, is approximately 8.314J K~!mol~!. The equation can 


be rewritten as T 
== R — 
ян 


This means that the pressure is a function of two variables, i.e. p = p(V,T). 


Fig. 12.2 


12.2 Functions of Several Variables 


Let us now leave the physical examples and consider the mathematical concept. If 
z is a function of two variables, x and y, then the relationship is usually expressed 
in the form 


z= f(x,y) 
Remember that, geometrically, a function y of one variable x(y = f (x)) represents 
а curve in ће x— y plane, as shown in Fig. 12.3. 


y 


у= х) 


Fig. 12.3 


Similarly, a function z = f(x,y) of two independent variables x and у can be 
thought of as representing a surface in three-dimensional space. 

A geometrical picture of the function z = f(x,y) can be obtained in two differ- 
ent ways. 
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12.2.1 Representing the Surface by Establishing a Table of Z- Values 


By giving x and y a particular value 
we obtain a value for z by substitu- 
tion in z = f(x,y) 


This value is erected perpendicular 
to the x— y plane at P'(x, y) and it 
determines a point in three-dimen- 
sional space. 


The procedure is carried out sys- 
tematically for many pairs of values 1 
(x, y) in the x— y plane by tabulat- 
ing the values as shown in the fol- 
lowing example. 


79-70- 709-79 
A ГА 1 


Fig. 12.4 53 (о) 
Example Values for the function Table 12.1 
22 ! x 0 1 2 3 
1+x2+ y? y 
are given in Table 12.1. 0 1 1 1 1 
2 5 10 
1 1 1 1 1 
2 3 6 11 
1 1 1 1 
2 
5 6 9 14 
1 1 1 1 
3 


EN 
о 
= 
m 
EN 
RB 
= 
Ке) 
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By plotting each computed value of z and the corresponding pair of values (х, у), 
we obtain a picture in three-dimensional space (Fig. 12.5). 

The set of values of (x, y) for which the function z = f (x, y) is defined is called 
the domain. 


Fig. 12.5 


12.2.2 Representing the Surface by Establishing Intersecting 
Curves 


Let us return to the function z = 1/(1-- x? + у2). Its domain is the entire x— y plane. 
Two characteristics of the function can be established at a glance: 


1. For x = 0 and у = 0 the denominator 1 + x? + y? has its smallest value. Con- 
sequently the function z (the surface) has a maximum given by 


f(0,0)—1 
2. As x — оог y — œ the denominator grows beyond all bounds and the function 
z tends to zero. 


Of course, these two characteristics are not sufficient to sketch the surface. Gener- 
ally speaking, the shape of surfaces is more difficult to determine than that of curves. 
Nevertheless, we can obtain a true picture of the function if we proceed systemati- 
cally by dividing the task into parts. The basic idea is to investigate the influence of 
each variable separately on the shape of the surface by assuming that one of the two 
variables is constant. 

If we regard y as being constant (y = уо) and vary x, then we obtain z-values 
which depend only on one variable. 

For example, if we set y — 0 in the above function we have 


1 
Е 1+ x? 
This represents an intersecting curve between the surface 2 = f(x,y) and the x—z 
plane at y = 0 (Fig. 12.6). 
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2, 


x 


Fig. 12.6 


For an arbitrary value y = yo, we have 
z(x) 


This represents an intersecting curve between the surface z = f(x,y) and a plane 
parallel to the x — z plane shifted by an amount yo along the y-axis (Fig. 12.7). 


1 
т 1+ х2 + yo? 


< 


x 


Fig. 12.7 


Similarly, we obtain a second group of curves by setting x to a constant (x — xo). 
For example, if we set x — 0 we have 


1 
z(y) — 14 y2 
For an arbitrary value x — xo we have 
1 
20) = 1+ xg? + y? 


Both z(y) curves are shown in Fig. 12.8a. 
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x 


(a) 


(b) 


Fig. 12.8 


By plotting both types of curves in one diagram we obtain a better picture. In 
this case the graph shows a symmetrical *hill (Fig. 12.8b). (Note that the values in 
a given row or column of Table 12.1 are the values of an intersecting curve.) 

The sketch becomes clearer if we fill in lines of constant z-value. Mathematically, 
they are the curves of the surface which are at a constant distance from the x— y 
plane; they are the intersecting curves of the surface with planes parallel to the x— y 
plane at a given z-value (Fig. 12.8c). 
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12.2.3 Obtaining a Functional Expression for a Given Surface 


In the above discussion we started from a known function and looked for the result- 
ing surface. Now we reverse the process and look for a functional expression for 
a given surface. 

For example, consider a sphere of radius R with the origin of the coordinates at 
the center (Fig. 12.9). Our task is to determine the equation of the spherical surface 
above the x— y plane. 

Referring to the figure and applying Pythagoras’ theorem, we have 


R2=224e, (2 х? фу? 


Thus we obtain R? = x? + y? + z? 
Solving for z gives 


Z1,2 =+y Е? – х2 – у? 


The positive root 21 represents ће spherical shell above the x— y plane. 
The negative root 22 represents the spherical shell below the x— y plane. 


The domain is —-R<x<R 
and -К<у< р 
such that xu y? < R? 


Having acquired a pictorial idea of functions of two variables, z = f(x,y), we now 
give a formal definition. 


Definition z = f (x,y) is called a function of two independent variables if 


there exists one value of z for each paired value (x, y) within 
a particular domain. 


Fig. 12.9 
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By plotting points (x, y) and z = f(x,y) ina three-dimensional coordinate sys- 
tem we obtain a graph of the function which represents a surface F within the 
domain D of the variables (Fig. 12.10). 


Fig. 12.10 


It is not possible to represent a function of three variables geometrically since to 
do so we would need a four-dimensional coordinate system. 

However, in physics and engineering such relationships play a very important 
role. For example, we can express the temperature Т of the atmosphere as a function 
of three variables: the latitude x, the longitude y and the altitude (above sea level) 
z, ie. T = T (x, y,z). 


12.3 Partial Differentiation 


Remember that the geometrical meaning of the derivative of a function of one vari- 
able is the slope of the tangent to the curve y = f(x) (Fig. 12.11). 


Fig. 12.11 
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Cut parallel to x-z plane 


Cut parallel to y-z plane 


—__ N 
22222 == y 
х 


Fig. 12.12 


In the previous section we considered as an example of a function of two vari- 


ables the function i 


£009) = UR [1] 


It represents a surface in three-dimensional space. By setting one variable at 
a constant value we obtain an intersecting curve of the surface with a particu- 
lar plane. 

We can slice the surface with planes parallel to the x—z plane (Fig. 12.13a). If 
the intersecting plane is at a distance yo from the x—z plane, the equation of the 
resulting curve is obtained by substituting y = yg in [1], i.e. 


1 
= —  —Z 
1+x2+ yo? 


z(x) 


is now a function of x only. 


Fig. 12.13 
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We can also slice the surface with planes parallel to the y—z plane (Fig. 12.13b). 
If the intersecting plane is at a distance xo from the y —z plane, the equation of the re- 
sulting curve is obtained by substituting x = хо in [1], i.e. z(y) = 1/(1 + xo? + y?). 
z is now a function of y only. 

Let us now consider an intersecting curve of the first type (where y is constant). 
As it is a function of one variable, z = f(x), we can calculate the slope o at any 
given point (Fig. 12.14). 


x 


Fig. 12.14 


In order to distinguish between the ordinary derivative and this new deriva- 
tive, we use the symbol ð instead of d. It indicates that we are differentiating 
a function of more than one variable with respect to a particular variable only, 
regarding all other variables as constant. Thus, when differentiating the function 
z= f(x, y) = 1/1 +x? + y? where y is kept constant at some fixed value yo, we 
obtain 


[NC 1 8 2x 
дх Ox 0 Эх 1+х2+у027 — (1-х2--уд2)2 


Since this holds true for any value y — yo, we can write 


oz d д 1 2x 
== 5 (х,у) = 29:32] — 24 y2)2 
Ox dx ox 11--х2--у (1+х2+у2) 
This operation is called partial differentiation with respect to x. 
Similarly, we may obtain the slope of the second type of intersecting curves. It is 
given by the partial derivative of the function with respect to y. x is kept constant at 
some fixed value (Fig. 12.15). 
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Fig. 12.15 


Thus we obtain 


e =F y= (а) аза 
ду ду???) By \1 4x2 492) (xix yn 


Functions of three variables, f = f (x,y,z), are treated similarly, but it is not possi- 
ble to present them in geometrical form. There exist, of course, three partial deriva- 
tives. Table 12.2 contains a summary of rules and an example of each one. 


Table 12.2 
Partial derivative Rule Example: 
Ff (x, y,2) =2x3 y +z? 
У fe 9f 2 
Partial derivative with respect Treat all variables as constants y =6x-y (12.1a) 
except for x * 
to x:—— 
Ox 2f 
Partial derivative with respect Treat all variables as constants E" =2x3 (12.1b) 
except for y У 
to у:—— 
ду 1 
Partial derivative with respect Treat all variables as constants ын = 22 (12.1c) 
except for z 2 
to z:— 
Oz 


The partial derivative may be written in another way. Let f (x,y,z) be a function 
of the three variables x, y and z, then the partial derivatives may be abbreviated as 


follows: 
Of _ 9f 


ay ee ay 5s 3: 44 (12.2) 
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Example f(x,y,z) =xyz 


then h= = yz 
SO 

D gn d 
=e 

and а 


Example Obtain the partial derivatives of the function 
z= 5x? +2ху-у2+3х-2у+3 at x=1, y=-2 
The partial derivatives are 


д 
Еа 10х+2у +3 
Ox 


Oz 
Mar MEN 
ду АВНА 


Substituting the values x = 1, y = —2 gives 


Oz 
(3:).., 210x14-2(-2)43-9 
y=-2 


(5) =2x1-2(—2)-2=4 
ду х=1 
у--2 


Note that we have introduced the expression 


Oz 
Ox) х-1 
у--2 
It means that the partial derivative with respect to x of the function 2 is to be evalu- 
ated at x = 1 and у = —2. 


12.3.1 Higher Partial Derivatives 
The partial derivatives are themselves functions of the independent variables 
x,y,..., in general. We can, therefore, differentiate them partially again. 


Example Let 
x 
f(x,y,z) = ye 2 
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2222: 
ox \ ду 
1 06:27 
ду NOx // 


The first expression means that we differentiate the function f first with respect 
to y and then with respect to x. 


Evaluate 


and 


д (of E df 
ox \ðy дхду 
д 
= 3x ^ = Уух 
Therefore i 
Уух = y? 
Similarly 
д (of J^ д 
ду (52) дудх 3y ^ foy 
Therefore 1 
Љу = 52 


The order of partial differentiation is immaterial. 
For most functions encountered in physics and engineering the following 
holds true: 
[xy = fyx, ete. (12.3) 


Example For the function и = x?/y sinz show that the following mixed third 
derivatives are equal: 


Uxyz = Uzyx 


2X 4 2 
Их = —sinZz Uz = —COSZ 
y 
2x , x? 
Mxy = ——;Ssinz Uzy = ——5CO0SZ 
y y 


2x 2x 
Uxyz = — —у COSZ Uzyx = ——zCOSZ 
y y 


350 12 Functions of Several Variables; Partial Differentiation; and Total Differentiation 


12.4 Total Differential 


12.4.1 Total Differential of Functions 


A function z = f (x, y) represents a surface in space; on this surface there are lines 
at the same level, z = constant. If we drop perpendiculars from these lines on to the 
x— y plane we obtain their projections on this plane. These projections are called 
contour lines. They are extensively used in geographical maps. 

Algebraically, contour lines are obtained by setting the function z = f (x, y) = С 
(where C is a constant). When f (x, y) = 1/(1 4- x? - y?), we have 


1 
14+x2+y? 
This is an implicit representation of a curve in the x—y plane. 
In this case we obtain contour lines which are circular, as shown in Fig. 12.16. 
This can be proved as follows. Rearranging gives 
1 


2 2 
= — – 1 
хг +у С 


Remember that the equation of a circle of radius R in the x— y plane is x? + y? = 
R?. Hence, in this case, R = 4/ 1/C — 1. The larger we choose C the smaller is the 
radius of the circle. (But, of course, C must not exceed 1, and it must be positive.) 

Following these preliminary remarks we are now in a position to look for the 
direction of steepest rise or decrease of the surface at a given point: 


1 
Zi 
14x24 y? 


Fig. 12.16 
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Fig. 12.17 


It is clear from Fig. 12.17 that the surface decreases most steeply in a radial direc- 
tion. (Note that, for the sake of clarity, the surface has been redrawn to a differ- 
ent scale.) 

Let us look more closely at this figure. 

If we travel the same distance dr from point A’ in the x—y plane: 


(a) in an arbitrary direction dr, 
(b) perpendicularly to a contour line dr», 
(c) along a contour line dr3, 


— Oe 
then, on the given surface, this corresponds to the paths AC, AB, AD respectively. 

The path AD is along a contour line. Hence dz3 = 0 and the function does not 
change at all. 

In contrast, the function z changes most rapidly along the path AB which is in 
a direction perpendicular to the contour lines. 

We are interested in finding how much the function z = f(x,y) changes when 
we travel a distance dr in an arbitrary direction dr = (dx,dy). The total displace- 
ment, in vector notation, is 

dr = dxi +dy j 


In Chap. 5 we saw that for functions of one variable the differential is an approx- 
imation for the change of the function for a given Ax, i.e. Ay ~ d f /dx Ax. In the 
same way the total differential is an approximation for the change in the function 
for small changes in x and y. The change of f (x, y) is obtained in two steps: 


1. by proceeding in the x-direction through a distance dx, with y remaining con- 
stant; 

2. by proceeding in the y-direction through a distance dy, with x remaining con- 
stant. 
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Fig. 12.18 


Let us be more explicit. Regard Fig. 12.18 


Step 1: For the change in z in the x- Step 2: For the change in z in the y- 
direction, with y remaining direction, with x remaining 
constant, we have: constant, we have: 

dz E )dx dz zd )d 
(х) = dx m (у) = ду 22 У 


The total change in 2 is the sum of these partial changes. Thus 


д2 Oz 
dz = dz; +dzZ(y) = dx dx + Зу dy 


Definition Тһе total differential of a function z = f (x,y) is given by 


д д 
dz — СРЕ dy (12.4) 
Ox ду 
The total differential is an approximation of the true change 
Az in the function z as we proceed from a point (x, y) a short 


distance in the direction dr — (dx,dy) 


Az = dz 


Example The function z = x? + y? has a total differential 


dz = 2x dx + 2y dy 
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Example The total differential of the function f(x,y) = T3555 is 


2x d 2y 
X 
(lx? Eye) (1+x?+ у2)2 


df(x,y)= dy 


Example Calculate the values of Az (the true change in z) and dz (the approximate 
change in z) if z = 5x? +3y at the point (2, 3) with dx = 0.1 and dy = 0.05. 
The true change in z is given by 


Az = f (x +Ax,y+Ay)— f(x,y) 
Hence with Ax = dx = 0.1, Ay = dy = 0.05 we have 
Az = 5(2.1)? + 3(3.05) — (5x 27+3 x 3) = 2.20 
The approximate change is given by the total differential 
dz = 10x dx + 3dy = 10 x 2(0.1) + 3(0.05) = 2.15 
The difference between the true value Az and the total differential dz is small. Thus 
Az & dz 


In practice, if dx and dy are small then the approximation Az z dz is acceptable 
and commonly used in many of the problems physicists and engineers encounter. 


Extension to Functions of Three Independent Variables f(x,y,z) 


In the case of a function f (x,y,z) of three independent variables the total differen- 
tial is given by 
д д 
7 dy+ 2 42 (12.5) 
2 


df = 3x Ec 


zÀ 


x dy 
Fig. 12.19 
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As before, the total differential is a measure of the change in the function 
и = f (x,y,z). If we proceed by a small displacement along dr = (dx,dy,dz), 
as shown in Fig. 12.19, the function changes by an amount equal to the total differ- 
ential. 


Example The volume of a parallelepiped is given by V = f (x,y,z) = xyz, where 
x,y and z are the lengths of the three sides. The total differential dV is 


dV = yz dx + xz dy + xy dz 


12.4.2 Application: Small Tolerances 


We know for a function of a single variable y = f(x) that if x is subject to 
an increment or decrement Ax, then the change in y is approximately given by 
Ay ғ f'(x)Ax. 

In the preceding section this has been generalised to several variables. For the 
sake of concreteness, let us concern ourselves with the tolerances of finished prod- 
ucts due to the tolerances of their components. Thus the increment or decrement is 
determined by the tolerances ô of these components. (In this section we will use the 
symbol ô instead of A.) 

If we have a function of several variables, such as и = u(x, y,z), then the total 
tolerance би due to individual tolerances бх,бу, 82 is 

u u u 
ĝu x шин say + a 
It is assumed that the tolerances ёх, ёу and óz are small. In practice, this is the case 
in most situations. 

Since и is a linear function in ёх, dy and óz, it follows that the total tolerance is 
obtained by adding the effects due to each one separately. 

For example, consider a dimension of a link in a mechanism. It would be spec- 
ified by its length x and a manufacturing tolerance imposed on it of +éx. When 
the part has been made and a check on its dimension is carried out we would hope 
to find that its length will lie in the interval x — 6x < x < x + ôx as a result of the 
manufacturing process. 

If a device consists of a number of parts, it will be affected by the tolerances im- 
posed on those parts. If и, the output, is a function of n parts of 
lengths x;,i — 1,2,...,n, and tolerances 6x;, then the tolerance in the output will be 


e ди 
ĝu x 2 25; 


The individual tolerance 6x; may have either sign and usually it does not attain 
its maximum value. But, if we assume the worst, we must add the effects of all 
maximum individual tolerances in order to obtain the maximum possible tolerance. 
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The maximum tolerance is then given, approximately, by 


n 
биз EY A 


Example Figure 12.20 shows diagrammatically a link pivoted at O and connected 
to another link at A (not shown). The position of the link at A (its output) will have 
an influence on the position of the link to which it is connected. This position will 
then depend on the tolerances +ô} on its length and +46 on its angle relative to 
some datum, i.e. the x-axis in this instance. If / = 95.00mm and д/ = +0.10mm, 
Ө = 35.00? and 0 = +0.25°, calculate the maximum tolerance in y and compare 
the result with its true maximum value. 


Fig. 12.20 


As a result of the manufacturing process, we know that / — 81 < 1 < l + ôl and 
0 —60 « 0 < 0 4-60. It follows, therefore, that A will lie somewhere inside the 
boundaries BCB'C'. 

Now let us calculate this maximum tolerance using the total differential ap- 
proach. 


у= (1,0) = Isin0 


Непсе 
9f, af 
óy = 3r? + 30°? 
of J. 
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Substituting in the above equation gives 


dy = sin0 (+81) + l cos 0(+60) 


0.25 
= si Э +0.1 9 2 prm IS 
sin35*( )--95с0835 ( 23) 


2465 2.0.25 
E ((sinss )0.1 + (95cos35 18) 


+0.3969 + +0.40тт to 2d.p. 


(Note that the factor 1/57.3 = 2x /360 is necessary to convert ће 66 value to radi- 
ans.) 

Now let us calculate the true maximum and minimum value of y. Considering 
the y position, its maximum value will correspond to B and its minimum value to B’. 
It is easy to calculate these values: 


Утах = (1 + 61) sin(0 + 50) 
= 95.1sin35.25 = 54.8865 = 54.89mm to 2d.p. 
and Ymin = 94.9811 34.75 = 54.0927 = 54.09mm to 2d.p. 


The exact or nominal value of y, ignoring tolerances, is 
y = lsin = 95sin35 = 54.4898 = 54.49mm 
Hence the maximum tolerances in y are 


Ymax — y = Sy = 54.89 — 54.49 = 0.4mm 
and min — Y = Sy = 54.49— 54.09 = 0.4mm 
i.e. dy = +0.4mm 


Both methods give the same result. 

This was a very simple case in which we could visualise easily where the output 
position of the links was likely to be, but in practice the problems encountered are 
much more involved and visualisation can be almost impossible. 

An example of this can be found in precision mechanisms where the output de- 
pends on the accuracy of a number of links, cams and gears. In such cases, the 
influence of tolerances can only be calculated using the total differential approach. 


12.4.3 Gradient 


In Sect. 12.4.1 the total differential of a function z = f(x,y) was defined as 


_of 97 
age aly 
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It is possible to regard the total differential as a scalar product of two vectors: 


first vector: dr = хі +dy j (called the path element) 
д 
second vector: grad f = 2 + 2 (called the gradient of f) 


It is easy to verify by inserting these vectors 


dz — dr grad f 


СОЛО РА 
Ox’ ду 
is called the gradient of a function z = f(x,y). 


grad f (х,у) = (52. 23 (12.6) 


Definition The vector 


The gradient has two properties: 


1. The gradient is a vector normal to the contour lines. Thus it points in the direc- 
tion of the greatest change in z. 

2. The absolute value of the gradient is proportional to the change in z per unit of 
length in its direction. 


In order to explain these properties, we will consider the scalar product 
grad f dr =dz 


If dr coincides with a contour line we obtain dz — 0, since a contour line is the 
projection of a line of constant z-value on to the x— y plane. Thus, in this case 


grad f dr =0 


We know from Chap. 2 that the scalar product of two non-zero vectors vanishes 

if and only if they are perpendicular to each other. Thus it follows that grad f is 

perpendicular to the contour lines, i.e. grad f is a vector normal to the contour lines. 
Let us illustrate this with the example used earlier: 


1 
0) = туз 


Remember that the contour lines are circles. 
The gradient of the given function is 


—2x —2y 
(CELEIA Балин 


grad f = ( 
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This vector always points in the direction of the origin of the coordinate system. 
Thus it is perpendicular to circles around the origin (Fig. 12.21). 


Fig. 12.21 


The absolute value of the gradient is a measure of the change of z. This follows 
directly from the equation 
dz =dr grad f 


For the particular case when dr is normal to a contour line, the equation can be 
rearranged: 
dz 
йн 


Thus the change in dz per unit of length in the direction of the gradient is given Бу 
the absolute value of the gradient. 

The concept of a gradient can be extended to functions of more than two vari- 
ables. In the case of a function of three independent variables, f (x,y,z), constant 
function values are represented by surfaces in three-dimensional space. The gradient 
is normal to these surfaces: 


| гай f | 


grad f (x,y,z) = GEE) 


Its magnitude gives the change in the value of the function in the direction of the 
gradient. 


12.5 Total Derivative 


12.5.1 Explicit Functions 


Up to now we have assumed that x and y are independent variables. It may happen 
that x and y are both functions of one independent variable f. If this is the case, z = 
f (x, y) is, in fact, a function of the single independent variable /. z will, therefore, 
have a derivative with respect to f. 
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Let x = g(t) and у = A(t); it is assumed that the functions can be differenti- 
ated. If £ is given a small increment Af, then x, y and z will have corresponding 
increments Ax,A y апа Az. The change or increment in the function z is given by 
"mol Of 


Ax + =—Ay 


DOS S ду 


Dividing by At we have 
Az of Ax of Ay 
АМ Ox At Oy At 


We can now proceed to the limit At — 0. The result is stated below. 


dz /dt is called the total derivative or total differential coefficient. 


dz „9f dx OF Ay 
dí ox dt dy dt 


Similarly if и = u(x, y, z) where x, y and z are functions of t, we obtain the 
total derivative of и with respect to 1: 


du | Oudx , dudy дийг 
dí дхй дуа 92:24 


(127) 


Note that we write z, not 92, since z is a function of a single variable. 
This concept can obviously be generalised for any number of variables. 
t n 
Example Letz = f(x,y)=x+yandx=5, у= % 
Obtain the derivative gz, 
Firstly we need the partial derivatives: 


Oz Oz 
a cb ag 


Secondly we need the derivatives of x and y with respect to 1: 


Hence the total derivative is 
dz 


e! 
d 2 2 


Note that z is familiar: z(t) = cosht, and the derivative with respect to f is 
dz/dt = sinh f. 
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12.5.2 Implicit Functions 


Remember that the function f(x,y) = C is an implicit function y of one single 
variable x in the x— y plane (contour line). We wish to obtain the derivative d y /dx 
without explicitly solving for y. In Chap. 5, Sect. 5.9.1, we showed how this could 
be done. Using the concept of the total derivative, we will now obtain a general 
expression. 

The geometrical meaning of a total derivative of the function z = f (x,y) is that 
it gives the total change in z. 

The total derivative with respect to x is 


dz ofdx ofdy 
dx дхах ody dx 


But since z is constant for a contour line (i.e. dz = 0), and dx /dx = 1, we obtain 


ОЕ 
| дх ay dx 
1 dy . dy | 9//дх 
Solving for dx gives di 9718) (12.8) 


Example If x? — y? +4xy = 0, 
calculate the value of the derivative dy/dx at x = 2, y = —2. 
Let = x? — y? +4ху 


д д 
Then С = 3x? +4y and ar = 23у? +4x 
Ox ду 
2 
Therefore dy = SU 
dx —3y? + 4x 
Hence the value of the derivative at x = 2, y = —2 is 
dy. X2 44()... es 
dx -3(-2)2+4х2 -4 


The equation f (x,y,z) = 0 can be considered as defining z as an implicit function 
of two variables x and y. We are interested in finding expressions for the partial 
derivatives of z, i.e. dz /ðx and дг/ду. 

We first concentrate our attention on д2/дх. 

We know that 


0- f(x,y,z) 


The total derivative of this expression with respect to x gives 


ERE DEC CPC 


: Ox ду дх Oz Ox 
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Remember that forming the partial derivative with respect to x implies regarding y 
as constant. Thus Оу/дх = 0, and solving for д2 /9х gives 


92 Ш | of /dx 

ox ӘЈ/д= 
Similarly, we obtain 

az __Əf/ðy 

dy д//д2 


Example Given х2/25--у2/15--22/9--1, calculate the partial derivatives of z. 


Of 2x OF 2y ӨД 2z 
ðx 25 ..0y 15? əz 9 


Substituting in the equations for the partial derivatives we find 


Oz 2x/25" и Ux Oz 2y/15 Oy 


Ox 22/9 2527 cDy-- —22/0  15z 


12.6 Maxima and Minima of Functions 
of Two or More Variables 


In Chap. 5 we derived the necessary conditions for a function of a single variable to 
have a maximum or a minimum. We now consider the conditions for maximum and 
minimum values in the case of functions of several independent variables. 

A function of two variables, 2 = f (x,y) (see Fig. 12.22), is said to have a max- 
imum at the point (xo, yo) if 


Af = f(xo +h, yo +k) — f (x0, yo) < 0 


for all sufficiently small values of / and k, positive or negative. 
The function will have a minimum if 


Af = f (xo +h, yo+k) — f(xo.yo) > 0 


From a geometrical standpoint, this means that when the point (хо, уо,20) on 
the surface z = f(x,y) is higher than any other point in its neighbourhood then 
(хо, Yo, Zo) is a maximum, but if the point is lower than any other neighbouring 
point on the surface it is a minimum. 

At a maximum or a minimum, the tangent plane to the surface is parallel to the 
x—y plane. This condition will be satisfied if 


fx(xo; yo) =0 and fy(x0, yo) =0 (12.9) 


The condition is necessary but not sufficient, as the following considerations show. 
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Maximum (xo, yo) 
z=f(x,y} 


Minimum 


| 
] 
l 
1 
I 
і 
і 
| 


(xo, yo) i 
t 
x e Hb th, Yo + А) 


Fig. 12.22 


Consider, for example, two adjacent hills (Fig. 12.23). If we go from the top of 
one hill to the other one (path 1) there is a minimum. If we go through the passage 
between the hills (path 2) there is a maximum. Even though the condition fy = 
fy = 0 1 satisfied at P, P is a saddle point and not an extreme point. 


x 


Fig. 12.23 


We will now state a sufficient condition. The following comments do not con- 
stitute a complete mathematical proof: they are only included as a hint for diligent 
readers. The expansion of the function with respect to h апа k is 


Af = f (xo +h, yo +k) — f(x0, yo) 
1 
= (hfe + Ју) хохо + xy (h f + 2hk fry +k? fyy)xoyo +" 
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The first term of the expansion is zero at a maximum or a minimum, since fy = 0, 

Ју = 0. For a maximum the expression must be negative, independently of h and k. 

For a minimum it must be positive, independently of h and k. Thus the sign of the 

second term determines whether there is a maximum, a minimum or a saddle point. 
Taking k? outside the brackets we have 


ХоУо 


2 2 
Af-t. Ө R42 do fs 


The expression in the brackets is a quadratic function in h/k. 

From Chap. 3, Sect. 3.5, we know that the graph of a quadratic function — 
a parabola – lies entirely above or below the horizontal axis only if this function 
has no real roots (Fig. 12.24). 


Fig. 12.24 


This means that the radicand must be negative in this case. Therefore, the sup- 
plementary criterion for the extreme value we were seeking is 


(fry)? — fxx fry <0 


Furthermore, there will be a maximum if 


fxx(xo,yo) «0 and fyy(x0,¥0) « O (12.102) 


and there will be a minimum if 


Љх(хо, уо) » 0 and /fyy(xo.yo) > 0 (12.10b) 
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This can be memorised by recalling the corresponding criteria for a function of one 
variable. 
There remains one other case to consider: 
[fey (xo, yo)? — fxx (xo. yo) fyy (xo. yo) = 0 


However, we shall not enlarge on this case; generally speaking, one needs fur- 
ther information to decide whether or not the point (xo, yo) corresponds to an ex- 
treme value. 

Let us recapitulate the results obtained. 


The necessary condition for the existence of an extreme value at xo yo is 


fx(xo, yo) = fy (Xo; yo) —0 


This condition becomes sufficient if it is supplemented by the following con- 
dition: 
[fey (хо, Yo)? — fax (xo: Yo) fry (x0; yo) < 0 


Maximum if fxx < 0 and fyy <0. 
Minimum if fxx > 0 and fyy > 0. 


Example Calculate the extreme values of the function f = 6xy — x? — y?. 
1. Apply the necessary condition for a horizontal tangent plane. 
fr =6y—3x7 =0, fy =6x—3y?=0 
Solving the two equations in x and y, we find that 


(a) x = 0, у = 0 15 one solution, i.e. the point (0, 0) 
(b) x = 2, y = 2 is the other solution, i.e. the point (2, 2). 


2. Apply the sufficient condition for a maximum or a minimum. 


Рх = —6x, Љу = 6, Љу = —6y 


The condition is 
Tg — fxx fyy = 36 — 36xy « 0 


We must check whether this condition is satisfied at the two points under con- 
sideration. 
Inserting x = 0 and y = 0 gives 36— 0 > 0 

Therefore there is no extreme value at the point (0, 0). 

Inserting x — 2 and y — 2 gives 


36—36x2x2«0 


In this case the sufficient condition is fulfilled: there is a minimum or a maxi- 
mum at the point (2, 2). 
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3. Check the type of the extreme value. 
Since fy, = —6x = —12 and Ју = —6y = —12 there is a maximum at the 
point (2, 2). Its value is fmax = 8. 


Example in engineering A container of 10m? capacity with an open top is to be 
made from thin sheet metal. Calculate what the dimensions of the sides and the 
height must be for a minimum amount of metal to be used. What is the saving 
compared with a container of equal sides? 

If V is the volume, A the surface area of metal and x, y and z the lengths of the 
container, we have 


V=xyz, A=2xz+2zy+xy (where z is the height) 
We need to eliminate one of the variables, z say; thus z = V /(xy). Substituting in 
the equation for the surface area A, we obtain 
2V 2V 
A= — + — +xy 
y x 
ie. A= f(x,y) is a function of two independent variables. 


1. Apply the necessary condition for an extreme value; calculate the partial deriva- 
tives and set them equal to zero: 


дА 2V x 0 дА 21 РЭ 0 

--- = — —— b pm — = — — = 

ду y2 ' Ox x15 
Solving the two equations gives 


х= у= #20 = V20 =2.714m 


Obviously, this makes sense only for a minimum of the area. But let us use the 
formal procedure to verify this common-sense judgement. 
2. Apply the sufficient condition for an extreme value: 


дА 4V A дА 4V 


ду? y?’ axdy 7 gx x) 
With x = y = V27 it follows that 
(Axy)? Age Ayy =-3 <0 


3. Axx and Ayy are positive; hence we have a minimum. 
The numerical result is given by 


aes ae V2.5 x 1.357m 
xy 4 


The amount of metal required = 4.2 x 714 х 1.357 + 2.7147. Hence A = 16.58 m?. 
If we made x = y =z = a, say, then a = 10 = 2.15 ш, and the amount of metal 
would be A, = 5a? = 5.2 х 152 = 23.11 m?. The saving is Ay — A = 6.53 т2. 
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Example in engineering A trough, 12 m long, is to be made out of a steel sheet 
1.65 m wide by bending it into the shape ABCD, as shown in Fig. 12.25. Calculate 
the lengths x of the sides and the angle 0 if the sectional area is to be a maximum. 


9 1= 1.65 т 


В 1— 2х C 
Fig. 12.25 


We first observe that the length of the trough is not relevant, as we are concerned 
only with the cross section. The relevant variables are the length x and the angle 0. 
Let us denote the cross section by A: 


1 1 
A= 3 (AD + BC) х (vertical depth) = 20 -2х--2хсо80-41-2х)х8ш0 


= [xsin0 — 2x? sin0 + x? sin 0 cos 0 


A = f (x,0)is a function of two independent variables . 


1. Calculate the partial derivatives and equate them to zero: 


24. =I sind — 4xsind + 2xsin8 cos =0 
x 
дА 2 7 PT, i2 
gg ^ 988 — 2x cos + x^ (cos 6 —sin^0) = 0 


Solve the two equations in x and 0: 

(a) By inspection, we find a trivial solution x = 0,sin@ = 0; but this has no 
physical meaning. 

(b) Assuming x Z 0 and sin0 Z 0, we divide the first equation by sin 0 and the 
second equation by x, so that 


| = 4х — 2xcos0 
l cos0 = 2xcos 6 — x(2cos? 0 — 1) 
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By combining these equations we obtain 


2g. 
р E cc 
X cos 0 


We solve for cos and х: 

2cos 6 = 1, i.e. соѕ0 = 1/20r0 = x /3(60°), and //x 23, i.e. x = 1/3 = 0.55 m. 

Steps 2 and 3, which are necessary to prove that this is in fact a maximum, are 
left to the reader. 


12.7 Applications: Wave Function and Wave Equation 
12.7.1 Wave Function 


Let us consider the function of two variables: z = f(x,y) = sin(x — y) It can be 
represented by a surface in space. 

To gain a proper understanding of this function, we first draw the intersecting 
curves with planes parallel to the x—z plane for the values 


3 
y=0, у= 2, у=л, у= : y —2m (see Fig. 12.26) 


2 x 
In the x-direction, the intersection curves are sine functions with a period of 27r. 
We may see already that the surface is a series of parallel hills and valleys. The 


direction of the hills and valleys is an angle of 45? to the x- and the y-axes. 


T am 
y-0 Ae. VON very арт 
2 

ae 

Г, /7 
4, 
v / б. V 

LF fy 


Fig. 12.26 


We now draw the intersecting curves with planes parallel to the y—z plane (see 
Fig. 12.27). 
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Fig. 12.27 


We obtain sine functions with the same period. Again we see that the surface is 
a series of parallel hills and valleys. 

The maxima and minima lie on parallel lines making an angle л /4 with the x- 
and y-axes. These are given by 


1 
x-y= (m + z) л for maxima 
1 зэл : : 
x—y = | 2n— 2 л for minima, where n is an integer 


Now let us examine the special case when one variable, say y, represents time. If 
we consider the value of time without its dimension, we have 


f (x,t) = sin(x — t) 


If we sketch the graph of this function we get the same picture, a series of parallel 
hills and valleys. The only difference is that f (x,t) of course is now a function of 
position and time. In practice, we often encounter this type of function f (x,t) as 
a function of x, the value of which changes with time. What is observed at a given 
time fo is the intersecting curve of a plane parallel to the x—z plane at the point 
t = to with the surface z = f (x,t) (Fig. 12.28). 

As time progresses, f increases and the intersecting curve changes. In this case, 
this results in a sine function travelling in the x-direction. We obtain a function 
which behaves like a wave on a cable. This type of function is called a wave function. 
The period in both the x-direction and the t-direction is 27 in our example. 

The length of one period in the x-direction is called the wavelength. If we want 
to describe a wave function with an arbitrary wavelength A, we must set 


f (x,t) =sin (275 D 
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f(x, t) = г 


—— ————» «xz plane moving with time t 


Fig. 12.28 


The period, usually denoted by Т, is the time interval of one oscillation at a given 
point xo. The inverse value is called the frequency v, i.e. the given function sin 
(x — t) has the frequency v — I. 

If we wish to describe an arbitrary frequency v — T we must write 


f (x,t) = sin (263 - T) 


We have now obtained a general formula for the one-dimensional wave function. It 
must be noted that now we can reintroduce the dimensions of the physical quantities 
position and time. The argument of the sine function remains dimensionless. 

The value of 27tv is also referred to as the circular frequency о. 

From another aspect, it can be observed that during the time T = 1 = 2л the 
wave travels one wavelength A. 

The velocity v of the wave is called the phase velocity: 


==, ie v=a 
о= т, be v v 
Using the circular frequency, the phase velocity can be expressed 


(0) 


SUE 


370 12 Functions of Several Variables; Partial Differentiation; and Total Differentiation 


Particular positions of the wave such as maxima, minima or zeros travel in the x- 
direction with this velocity. 

Generally, the one-dimensional harmonic wave function is written in two equiv- 
alent forms: 


f (x,t) = Asin ЕЕ 


= Asin (Fe- vt) +) 


The argument of ће wave function is called the phase. фо is ће phase at t = 0 
and x = 0. A is the amplitude. It may be the physical displacement of a point, an 
electrical quantity, an air pressure (sound waves), a distortion etc. A may be a scalar 
or A may be a vector, like a displacement r or an electrical field vector E. 

Figure 12.29 shows the usual graphical representation of the wave function. Fig- 
ure 12.29a gives the wave for a fixed time fo; Fig. 12.29b gives the oscillation of 
a fixed point xo. 


(a) 


| —— >. —— 


(b) 


Fig. 12.29 
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A harmonic wave propagating in the opposite direction is given by 


f (x,t) = Asin (Fs fore) 


2л 


or f (x,t) = Asin БОХЛО 


Spherical Waves 


In physics we frequently encounter waves which travel in all directions from the 
origin of a point source. The wavefronts of such waves are concentric spheres with 
the source as center. The separation of two adjacent wavefronts having the same 
phase is equal to one wavelength. Electromagnetic and acoustic waves are often 
spherical waves. 

In the case of spherical waves, we have to take into account the fact that their 
amplitudes decrease with the distance from the source; for instance, this is the case 
for sound waves. 

In the case of an acoustic wave, the amplitude of the air pressure p is given by 


the following function: 
Po) . 2л 
== —r—ot 
р (5 sin (22 or +60) 


where p is the pressure difference compared with the air pressure of the air at rest 
and r is the distance from the center of a harmonic sound source. 


12.7.2 Wave Equation 


Wave functions are solutions of differential equations of the form 


9? f (x,t) ши! 92 f (x,t) 
ox? e te 
The RHS is the second derivative with respect to time, and the LHS is the second 
derivative with respect to displacement. 

Equations in which differential coefficients appear are called differential equa- 
tions (Chap. 10). In the above case, since the differential coefficients are partial 
ones, the equation is referred to as a partial differential equation. 

The equation is known as the one-dimensional wave equation with a velocity of 
propagation c. 

Waves on the surface of a liquid, or on a stretched membrane, lead to a wave 
equation being two-dimensional for the function f (х, y, t): 


дх2 9y? c? ðt? 


3*3. ok od gay 
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In the case of sound and electromagnetic waves we find the wave equation to be 
three-dimensional for the function f (x, y,z,t): 


2g qom OD 


Ox2 ду? 922 c? ət? 
Thus the wave equation occurs in many different fields of physics and engineering. 
J. Maxwell showed the relationship between electric and magnetic fields by means 
of the wave equation, and this started the search for electromagnetic waves. H. Hertz 
proved their existence experimentally in 1888. 

The general solution of partial differential equations is a subtle problem in math- 
ematics. There is no general method like, for instance, the exponential solution for 
ordinary differential equations (see Chap. 10). 

Starting with the general solution of ordinary differential equations, we were able 
to obtain particular solutions from the statement of the boundary conditions. Partial 
differential equations have no general solutions, only particular ones; consequently, 
the boundary conditions have a marked influence on their solutions. 

The wave equation has a great number of solutions. Which one is chosen depends 
on the boundary conditions of the problem. 

We will concern ourselves with the one-dimensional case and show that any func- 
tion of the following form is a solution of the one-dimensional wave equation. Thus 
и can be any function which is differentiable twice with respect to x and t: 


f (x,t) = u(x — ct) 


D’Alembert’s solution 


Р’ Alembert noticed that x and ct have the dimensions of length and he therefore 
introduced two new independent variables: 

p=x-ct 

q=x+ct 


He then reduced the wave equation to a form which can readily be integrated. It can 
be shown that by these substitutions the wave equation becomes 


827 = 
дрда | 
d fofy 
í a (az) = 
Integrating once, we obtain 
Of | 


a^ Q (q) , an arbitrary function of q опу. 
q 
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Integrating once again gives 


f= | 004) + Pt) 


where Р(р) is an arbitrary function of р only. 
If we let f Q(q) dq = G(q) the solution becomes 


f = P(p)+G(q) 


Substituting for p and q, we obtain 
f = P(x—ct)4- G(x 4 ct) 


This is d' Alembert's solution of the wave equation. P and G are arbitrary functions. 

This result implies that, physically, the wave equation is satisfied for waves trav- 
elling in opposite directions. It gives rise to stationary waves. A stationary wave can 
be produced by the superposition of two harmonic waves of equal frequency trav- 
elling in opposite directions. It can also be produced by a progressive wave being 
reflected at a boundary, provided that the conditions are suitable. Examples are the 
vibrations in a pipe and the vibrations of a string fixed at each end. 

In this book we will deal no further with partial differential equations. Further 
examination belongs to more advanced texts. 


Exercises 


1. Construct a table of values for the function f(x,y) = х2у +6 where 
x = —2,—1,0,1 and y = —2,—1,0,1,2. 
2. What surfaces are represented by the following functions? Sketch them! 


(а) 1--х-2у-2 


(b) z 2x? y? 
2 2 
x y 
== Па 
4 4 9 


12.3 Partial Differentiation 


3. Obtain the partial derivatives of 


(а) f(x,y) = sin x +cos y (D f(x,y) =x?/1—y? 
(с) f(x,y) Sec ^» (d) f(x,y,z) = хуг хун 
(е) f(x,y,z) =e*Iny+z4 (f) f(x,y) = esinx + есо5(+У) 


4. Determine the slope of ће tangent in the x- and y-directions to the surface 
z = x? + y? at the point P = (0,1). 
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5. Determine the partial derivatives /хх, fry, fyx and fyy of the function 


z= R? x? у? 


2 


6. Show that the function z = 607757 satisfies the relation x f, + yfy = 0. 


12.4 Total Differentiation 


7. Determine the total differential of the functions 
(a) 2-24/1-132--»у2 
(b) z= x? +y? 

(с) f(x,y,z)= (24224627) 

8. A container in the form of an inverted right circular cone has a radius of 
1.75 m and a height of 4m. The radius is subject to a tolerance of 50mm 
and the height to a tolerance of 75 mm. 

(a) Calculate the total percentage tolerance in the volume. 
(b) What is the total percentage tolerance in the surface area of the con- 
tainer? 

9. Find the contour lines and calculate the gradient for the following functions: 


(а) f(x,y) =—x—-2y+2 


x2 


i ae aes 

(b) eae 
| 10 

(c) PUNS Ty 


10. Find the surfaces of constant functional values and calculate the gradient. 


() /(х,у,2)-х4у--32 
(b) (x,y,z) = x* 4 y? 
(©) fG yz) = (x+y? e y 


12.5 Total Derivative 
11. Obtain du/dt when 
(а) u = x? — 3xy + 2y? and x = cost, y = sint 
(b u2x-4J/xy -3y,x =t?,y=1/t 
12. (а) u = x? + y?,y =ax+b 
Obtain gu 
(b) x? — y? +4ху= 0 
Obtain ay atx=2,y=—2 


(c) xy +siny = 2 
Obtain ay atx —4,y = л/2 
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12.6 Maxima and Minima 


13. Examine the following functions for maxima and minima: 

(а) f(x,y) =x? —3xy+y? 

(b) f(x,y) = 12x +6y - x? - xy- у? 

(c) Application in engineering: A conduit along a wall is to have a cross- 
section (as shown in Fig. 12.30) made by bending a sheet of metal of 
width 0.75 m and length 5.5 m along the line ABCD. Calculate Л,/ and 
0 for maximum cross-sectional area. 


Em 
i—i 


Fig. 12.30 


12.7 Wave Function and Wave Equation 


14. Two cables considered as being infinitely long are excited at the left-hand 
end with an amplitude A and a frequency f. Write down the wave func- 
tion for 

cable (a) A=0.5m, f =5Hz, A=1.2m 
cable (b) A=0.2m, f=0.8Hz, A=4.0m 


15. Verify that the function f (x,t) = e"! -x)” satisfies the wave equation 


М е А 
922 дх2 


Chapter 13 
Multiple Integrals; Coordinate Systems 


13.1 Multiple Integrals 


Let us develop the problem by a simple example. 
A solid cube, as shown in Fig. 13.1, has a volume V. If the density р is constant 
throughout the entire volume then the mass is given by 


M = pV 


There are cases, however, in which the density p is not constant throughout the 
volume. The density of the Earth is greater near the center than at the surface. The 
density of the atmosphere is at a maximum at the surface of the Earth: it decreases 
exponentially with the altitude. 

Let us assume that the variation in the density is determined empirically and ex- 
ists in a three-dimensional table of values or in the form of an equation as a function 
of position, i.e. p = р(х, y,z). 


Fig. 13.1 


K. Weltner, W. J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
ISBN 978-3-642-00172-7 © Springer 2009 
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To obtain an approximation for the mass when the density varies we proceed as 
follows. The volume is divided into N cells, the volume of the ith cell being 


AV; == Ax; Ay; А; 
If the density at the point P; (xi, уг,2;) is p the mass of the cell is 
АМ; ғ p(xi , Yi, Zi Ax; Ay; AZi 


The mass of the entire cube is obtained, approximately, by adding up the masses of 
all the cells. Hence 


N N 
M = YAM; = У, p(xi, Yi, zi)xi NyiAzi 


Now let the size of the cells be taken smaller and smaller, so that N tends to infinity. 
In this way we get closer and closer to the exact value and can write 


N 
Эш У, р(х, уг,2:)Ах;АугАг; 
ete | 


When we were dealing with a function of only one variable, such a limit was called 
an integral. We now extend this concept to the above sum. In the limiting process, 
the differences Ax; , Ay; and Az; become differentials dx,dy,dz; and to express our 
limiting process we use three integrals, one for each variable, and write 


м = ||| o(x.y,2) dx dy dz 
V 


In words, we describe this as the integral of the function p over the volume V. Such 
an integral is also referred to as a multiple integral; the special case of three variables 
is called a triple integral or a space integral. To solve for M we have to carry out 
three integrations, taking each variable in turn and paying attention to the limits of 
integration. 

There are two cases to consider: 


(a) Multiple integrals with constant limits 
All limits of унэн are constant. 


Example 12 ae es p(x, y,z) dx dy dz 
=-л 


(b) Multiple integrals with variable limits 
Not all the limits of integration are constant. 


Example L 1 p» p(x, y,z) dx dy dz 
=-л 


The analytical evaluation of multiple integrals is discussed in the following sections. 
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Many multiple integrals can be solved analytically. There are, however, cases 
which lead to very complex expressions or which cannot be solved at all. In such 
cases, the values of multiple integrals can be computed approximately by means of 
sums which are sufficiently exact for practical purposes if the subdivision is fine 
enough. 


13.2 Multiple Integrals with Constant Limits 


The actual execution of a multiple integration is particularly easy if all the limits 
of integration are constant. It is thus reduced to the repeated integration of simple, 
definite integrals. In our example of a solid cube, the computation of the mass of the 
cube is obtained by integrating throughout the entire volume (cf. Fig. 13.1), i.e. 


along the x-axis from O to a 
along the y-axis from O to b 


along the z-axis from O to c 


The triple integral sign denotes the following operations: 


Да Г, Im Bin ore) dx dy de 


Linnet Integral 
Second Integral 


Outer Integral 


Step 1: Obtain the inner integral. y and z are regarded as constants; the only vari- 
able is x. 

Step 2: The result of the first integration is a function of the variables y and z. We 
now solve the second integral assuming z to be constant by integrating with 
respect to y. 

Step 3: Finally, we are left with a function of z alone and the outer integral is ob- 
tained. 


Note that in the case of constant limits the order of integration can be changed, 
provided the integrand is continuous. 

Multiple integrals are referred to as simple, double, triple, quadruple etc., de- 
pending on how many integrations are to be performed. 
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Example Calculate the mass of the rectangular prism shown in Fig. 13.2 of base a, 
b and height h. 


x 


Fig. 13.2 


The density decreases exponentially with height according to the relationship 


02 


р = poe 


This example is of practical interest in the calculation of ће mass of а rectangu- 
lar column of air above the Earth’s surface. Due to gravity, the density decreases 
exponentially with increasing altitude. 


po is the density at z = 0 


The constant o has the form о = po/ Po g, where g = acceleration due to gravity 
and Ро = barometric pressure at z = 0. 
The mass of the column of air is calculated by the following multiple integral: 


h pb га 
M -| T | poe ^? dx dy dz 
o Jo Jo 


Evaluation of the inner integral gives 


h b h b 
M= | y poe ^" [x]g dy dz = «| | poe ^7 dy dz 
о Jo о Jo 


Evaluation of the second integral gives 


h h 
M= «| рое [y]8 dz = ab | poe ^^ dz 
0 0 
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Evaluation of the outer integral gives 


e abpo 


200 (1 — ea) 
0 


1 
M= SE dd 
abpo ES | 


Figure 13.3 shows that the mass of the column of air approaches a limiting value M... 


Fig. 13.3 


13.2.1 Decomposition of a Multiple Integral 
into a Product of Integrals 


There are cases in which the integrand can be expressed as the product of functions, 
each in terms of a single variable. 


f(x,y,z) = g(x)h(y)m(z) 


Hence the multiple integral is the product of simple integrals: 


de. P fosa dx dy ас- [ a(x) ах | n9) ay f mt) dz 


Example Evaluate / — Та fu. sin x cos y dx dy. 
The integrand is the product of two independent functions. Hence 


Sm i mA, . 
I zi sin x ax f cos y dy —[-cosx]g' [siny]g =0 
0 0 
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13.3 Multiple Integrals with Variable Limits 


Multiple integrals with constant limits of integration are a special case. Generally, 
the limits of integration are variable. Now we will consider the general case of vari- 
able limits. 

We will demonstrate the procedure with an example of the calculation of the area 
shown shaded in Fig. 13.4. 


The area is obtained by summing all the small areas or meshes, such as AA = 
AxAy, within the boundaries so that 


N 
A ғ Y Axj Ayi 
1-1 


By letting N — со we obtain a double integral: 


A= [[a4- [| dx dy 


The problem now is how to regard the boundaries of the area. 
Let us consider a small strip (Fig. 13.5) of width dx corresponding to a summa- 
tion in the y direction. 
The limits of this integral with respect to y are 
lower limit , y=0 


upper limit , y= f(x) 


In this case the upper limit is a function of x. We now insert this into the formula 


and obtain 
f(x) 
A= n dx dy 
y=0 
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Fig. 13.5 


The limits of the variable x are constant: 


lower limit , х-а 


upper limit , x=b 


Inserting these limits into our double integral gives 


b (f(x) 
A= | 1 4х 4у 
х=а Ју=0 


Here the order of integration is no longer arbitrary. We must integrate first with 
respect to variables whose limits are variable. We integrate first with respect to y 
obtaining 


b b 
Аь | LY (x) -ojax = f у(х) dx 


We are already familiar with this result. We see that if we solve the area problem 
systematically we first obtain a double integral. In Chaps. 6 and 7, when dealing with 
areas under curves, one integration had already been performed, without mentioning 
it, by considering a strip of height / (х) and width dx. 


Example Calculate the area A bounded by the curves shown in Fig. 13.6. 
The area A has the following boundaries: 


y=x? for the lower one, 


у= 2х for the upper опе. 


If we integrate in the direction of y from y = x? to y = 2x, we obtain the area of 
the strip of width dx. The required area A is then obtained by integrating along x 
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Fig. 13.6 


from x = 0 to x = 2, since both curves intersect at x = 0 and x = 2. Hence 


2 2x 
A= | | 4у 4х 
x=0 Ју=х2 


Now we have first to integrate with respect to y, since its limits are variable: 


2 13]? 4 
4-1 (2х-х2)4х- а-та) == 
0 3 
We can generalise the process as follows. 

Multiple integrals with variable limits are evaluated step by step, evaluating the 
integrals with variable limits first, up to the last integral, whose limits must be con- 
stant. 

Thus at least one integral must have constant limits. 

You can proceed as follows. 

Find a variable which does not appear in any limit of the integrals. Now integrate 
with respect to this variable. Repeat this procedure until all integrals are dealt with. 
Following this procedure, integrals with variable limits are dealt with first. 

Multiple integrals often occur in practical problems. 


Example Obtain the position (X, y) of the center of mass for the area A in the 
previous example (Fig. 13.6). A problem of this type in mechanics has already been 
discussed in Chap. 7. 

First let us find the component y of the position of the center of mass. To do this 
we take the first moment about the x-axis of an elemental area dx dy and then sum 
for the whole area. Hence 


2 2x 
лу = | T „Y dy dx (A = area) 
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Let us evaluate this integral. Since the variable y does not appear in any limit, we 
integrate with respect to y first: 


2 272% 2 4 512 
= M 2 X 2 3 X 
Ay = — dx = 2x^—— | dx = | -x7 — — | =2.133 
7 | | 2 p й А ( й 2 ) й E io], 


Second, we find the component X in a similar way. In this case, consider the first 
moment of the elemental areas dx dy about the y-axis. 


2 2x 
AX = 1 if x dy dx 
x=0 Ју=х2 


2 2 
and AX = ў, [ху] 25 dx =} (2x? — x3) dx = 
Since A = 4/3, it follows that x = 1, y = 1.6. 


Example Determine the area of a circle with radius R. 
The area is given by a multiple integral, 


A= [[axay 


The problem is how to take the boundaries of the circle into account. Let us look at 
Fig. 13.7. We wish to sum the elemental areas dx dy in the y-direction, indicated 


| 
E: 
TTT NY 


Fig. 13.7 


by the small strip. This means that we integrate with respect to y first. The limits of 
this integration are given by the boundaries of this small strip and these in turn are 
given by the familiar equation of a circle 


х2 +y? =R? ог y=+yR2?-x2 
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Lower limit, yo = —V R? — х2 
Upper limit, y; = +v А2 — x? 


Thus we obtain 
ур-у х2 
А- I dy dx 
yo=y R?-x? 4 
А- fi Di dx 
A- Еа ах 
The expression 2v R? — x? dx is the area of the small strip. 


The remaining integral with respect to x sums together the elemental strips from 
—R to +R. Thus the limits are 


lower limit, xg ——R 


upper limit, x; = +R 


+R 
A= 2V R? — x? dx 
xo—-R 
Using the table of integrals in the appendix to Chap. 6, we obtain 
R 
R2 
А-2 Ё К? — х2 + sin“! (5) p 


А = В2л 


We will see in ће next section that this result сап be obtained much more easily by 
using polar coordinates. 


13.4 Coordinate Systems 


The evaluation of volumes, masses, moments of inertia, load distributions and many 
other physical quantities leads to multiple integrals. The integrals are not always of 
a simple type with constant limits of integration. However, in many cases we can 
obtain simpler types if we replace the variables x,y,z by other more appropriate 
ones. This implies that we should select our coordinate system carefully accord- 
ing to the particular symmetry of the problem. For circular symmetries we choose 
polar coordinates or cylindrical ones. For radial symmetries spherical coordinates 
are advisable. In the following discussion we examine polar coordinates, cylin- 
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drical coordinates and spherical coordinates and relate them to Cartesian coordi- 
nates. 


13.4.1 Polar Coordinates 


Polar coordinates have been first mentioned in Sect. 7.1.2. A point P in the x—y 
plane can be represented by the position vector r, as shown in Fig. 13.8. In Cartesian 
coordinates, the position vector is given by its x — y components. The same position 
vector can be defined by two other quantities: 


the length of r, 
the angle ¢ with respect to the x-axis, or any other fixed direction. 


These two quantities are called polar coordinates. 


Fig. 13.8 


Polar coordinates can be obtained from Cartesian coordinates and vice versa. The 
equations of transfer can be derived from Fig. 13.8. We obtain Cartesian coordinates 
from polar coordinates by 


x-—rcosó 
y=rsing 

We obtain polar coordinates from Cartesian coordinates by 
r=.1/x2 + y2 


tang = 4 
х 
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An elemental area in Cartesian coordinates is given by dA = dx dy (see Fig. 13.9a). 


Fig. 13.9 


In polar coordinates, an elemental area is given by 
dA=rdd¢dr (see Fig. 13.95) 


Example Compute the area of a circle of radius R. 

We have to sum the elemental areas within the boundaries of the circle. The 
variable ф extends from ф = 0 to ф = 2л. The variable r extends from r = 0 to 
r — R. Thus the limits of integration for both variables are constant. 


R 2л 
A= [а= | 1 r dọ dr = xR? 
r=0 Jġ=0 


Note that the area of a circle with polar coordinates is obtained far more easily than 
with Cartesian coordinates. 


Example Compute the area within the spiral r = аф,а > 0 for one rotation of the 
radius vector (see Fig. 13.10). 


Yh 


Fig. 13.10 
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Consider the elemental area dA = r d$ dr. The total area A is given by the inte- 


gral 
A= || аа 


The variable ф extends from ¢ = 0 to ф = 2л, since we are considering one rotation. 
The variable r extends from r = 0 to r = аф. Thus the limits of r are variable. 
Inserting the limits and solving the integral gives 


2л гаф 2л r2 ag 
4-1 1 rar dg = | Е аф 
ф=0 Jr=0 0 2 10 
2л 


2л д2 а? 4 
2 3 2-3 
== — dé = | — — c 
| зан ia ie 


13.4.2 Cylindrical Coordinates 


Cylindrical coordinates are polar coordinates for a point in three-dimensional space 
obtained by the addition of the coordinate z to specify its height, as shown in 
Fig. 13.11. 


Fig. 13.11 


The equations of transformation between cylindrical and Cartesian coordinates 
are 


x = rocoso 
y =rosinġ 
Zu 


or, in the reverse direction, 
ro= Vx? + y? 


tang = 


"ox 


Z 
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The elemental volume dV in Fig. 13.12 is then given by 
dV = го аф dr dz 


Cylindrical coordinates can facilitate calculation in the case of either of the follow- 
ing symmetries. 


Fig. 13.12 


Axial Symmetry 


For cylindrical coordinates we only need the functional relationship between ro 
апа =, since these are independent of the angle ф. 

Examples are the chess piece in Fig. 13.13, and the magnetic field round a coil 
(Fig. 13.14). 


Fig. 13.13 Fig. 13.14 
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Cylindrical Symmetry 


In cylindrical coordinates the function which describes the quantity under consid- 
eration depends only on the distance ro from the z-axis: it is independent of both z 
and ф. 

An example is the magnetic field H = H (r) surrounding a straight conductor 
carrying an electric current (Fig. 13.15). Its absolute value possesses cylindrical 
symmetry. 


Fig. 13.15 


13.4.3 Spherical Coordinates 


Spherical coordinates are particularly useful in problems where radial symmetry 
exists. Furthermore, these coordinates are used in geography to fix a point on the 
Earth's surface; it is assumed that the surface of the Earth is spherical. Spherical 
coordinates are also called spatial polar coordinates. 

To fix the position of a point in these coordinates, we need three quantities: 


r, the position of the radius vector, 

0, the angle between the radius vector and the z-axis, known as the polar angle, 

Ф, the angle which the projection of the radius vector in the x— y plane makes 
with the x-axis, known as the meridian. 


To determine the equations of transformation between Cartesian and spherical co- 
ordinates, we start with the projection of the position vector r upon the x— y plane. 
The projection of the position vector upon the x— y plane has the length r sin 0. 
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The relationships are then easily shown (see Fig. 13.16) to be 


х = г5іпӨсоѕф 
y=rsindsing 


z—rcosÓ0 


Fig. 13.16 


The equations in the reverse direction are 


r=Vx2+y242? 


2 

cos Ü = ——————— 
XS y? pog? 
y 

t = — 

апф x 


The elemental volume is given by 
dV = r?sin0 40 dọ dr 


Itis a little more difficult to determine. Let us find it by taking one step at a time. 
dV in the direction of the radius vector has a thickness dr and a base area dA’ 
(Fig. 13.17), so that 

dV — dAdr 
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Fig. 13.17 


From Fig. 13.18, we see that 
dA = (rsin@ dd) (r d0) = r?sin0 40 d$ 


Hence it follows that 
dV =r?sin6 40 d$ dr 


rsin6d@ 


rd@=rsin аф 


Fig. 13.18 
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Example Obtain the volume of a sphere of radius R. 


In order to obtain V we integrate over the three spherical coordinates. The vol- 


ume V of the sphere is 
л үл R 
| | г2ѕіп Ө dr 40 dọ 
=0 J0—0 Jr=0 


2л үл R? 
54) A — sin 0 d0 d$ 
o Jo 3 


2л R 27 2 R? 
-['5 --с08010 dọ = | T db => 


Spherical Symmetry 


Examples of spherical symmetry include the gravitational field of the Earth, the 


electric field of a point charge and the sound intensity of a point source. 


In spherical coordinates, the absolute value of the describing function depends 


only on the distance r from the origin and not on the angles 0 and ф. 


= (0) 


Table 13.1 shows the important characteristics of cylindrical and spherical coordi- 


nates and their relationship with Cartesian coordinates. 


Table 13.1 
Coordinates Equations of transformation Elemental volume 
Cartesian х dV =dx dy dz 
y 
2 
Cylindrical x = r соѕф r=Vx2+y2 dV =r аф dr dz 
y=rsing tang = » 
x 
2502 2-2 
Spherical x=rsinOcos@ r=.V/x2+y2+4+22 dV = r? sin 40 d$ dr 


y=rsinOsin@ cos = 


z=rcos0 tang = 


Suitable for 


axial 
symmetry; 
cylindrical 
symmetry 
spherical 
symmetry 
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13.5 Application: Moments of Inertia of a Solid 


In Chap. 7, Sect. 7.4.4 we dealt to some extent with moments of inertia. Here we will 
show the calculation of moments of inertia using the concept of multiple integrals. 
The moment of inertia plays a major role in the dynamics of rotary motions. 

The energy of rotation of a mass element dm rotating about the axis of rotation 
with the constant angular velocity w (see Fig. 13.19) is 


1 
dE; = 5 r^ dm 


r denotes the perpendicular distance from the axis of rotation. Thus the energy of 
rotation of the total body is 


1 
Et = 5% | r? dm 
2 body 


The following quantity is called the moment of inertia: 


d r? dm 
body 


Axis of rotation 


Fig. 13.19 


Example Calculate the moment of inertia of the cylinder shown in Fig. 13.20. The 
axis of rotation is the axis of the cylinder. The density is constant throughout the 
body. This problem can best be solved using cylindrical coordinates. 

Consider a mass element dm inside the body at a distance r from the axis of 
rotation. The moment of inertia of the mass element is 


dI =r? dm 
The mass element can be expressed in terms of its volume and density so that 


dm = pdV 
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Fig. 13.20 


The moment of inertia of the whole cylinder is 


га рау =p | av 
V 


The elemental volume dV in cylindrical coordinates is 


dV — r d$ dr dz 
Thus 
h R 2л 
I =| j ! гЗаф dr dz 
z=0/r=0/¢=0 
Rh R? 


—M (М = total mass = oR? zh) 


Example Now we will consider an example in Cartesian coordinates. The solid 
shown in Fig. 13.21 has a square base OABC, vertical faces and a sloping top 
O'A'B'C'. We want to calculate the moment of inertia about the z-axis. 

OA = 25mm, АА! = 50mm, CC’ = 75mm, ОО” = 100mm. The density is uni- 
form throughout the body (p = 7800kg/m?). 

We must first find the equation of the plane O'A'B'C' to get the upper limit of the 
variable z. 

The general equation of a plane is given by ax 4- by +cz +d = 0. The constants 
a,b,c and d are determined by inserting the given values of the four points O' A'B'C' 
in the general equation. For example, 


O'—(0,0,100), sothat cx100+d=0 and d —-—100c; 
A’ = (25,0,50), sothat ax25+cx50+d=0 and d= –25а – 50с. 


There are four such linear equations giving a = 25,b = 1,c = 1,d = —100. The 
required equation is 25x + y + z — 100 = 0. 
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x 


Fig. 13.21 


= |} fol x+y? ) dx dy dz 
100—25x-y 
-Г 42 Эл p(x? +y?) dz dy dx = 13.875 x 106p 
x=0 Jy=0 Jz= 


= 1.079 х 10 ^kg/m? 


To conclude this example, let us also calculate the radius of gyration (see Chap. 7, 
Sect. 7.4.3): 


cp p dx dy dz 


25 100—25x-y 
=pf aie a dz dydz, since p is constant 
Z= 


= i s (100 — 25x — y) dy dx — 39062.5p — 0.305kg 
0 0 
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The radius of gyration with respect to the z-axis is 


1 1.079 x 10-4 -2 
БИ ТЭ 0305 = 1.88 х 10 ^m or 


18.8 mm 


Appendix 


Applications of Double Integrals 


Field of Expression 
application General Cartesian coordinates Polar coordinates 
у= f(x) r=8(¢) 
X2 [Y2 Фә (72 
Area A A | аа er | 4у 4х 21 Í r dr 4ф 
JA Ух: JYI 4$) Sry 
: X2 [Y2 $2 (T2 5, 
First moment М; = / ydA = | | y dy dx = Í r^ sing dr аф 
or static moment sA SNP or» Jy. er 
хэ (Уә $2 (72 > 
My = | xdA al | x dy dx = [ r^ cos $ dr do 
JA Hp JYI Jb, Jri 
: » Mx 
Centroid = 
entroi x A 
. My 
d 1 
2 *2 [У2 5 $92 [T2 302 
Moment Ix = | y-dA x y*dy dx e Í r?° sin^ $ dr аф 
of inertia P Er onda LOL uf 
Хэ [Y2 $5 [T2 
Iy- | x?dA = | | x?dy dx -| i: r? cos? $ dr 4Ф 
JA JX, JV J) Jri 
2 Фэ [T2 
Polar moment Io = n dA 213 L4 (x? + y? )dydx = | Ї r^ dr d$ 
xy AAT 
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Exercises 


13.2 Multiple Integrals with Constant Limits 


1. Evaluate the following multiple integrals: 


(a) LU. nod ш т ei 
(c) [f oss dx dy (d) [f ooa dv 


1/2 1 2 1 У1 21 
(е) | 4х 4у 42 (f) у | ip e^? dx dy dz 
х--1/2Уу--1/2-0 x=0 Ју=уо 2-2 


13.3 Multiple Integrals with Variable Limits 


2. Evaluate the integrals 


2 3x 
(a) | у x?dx dy 
х=0 Ју=х-1 


1 2х х+у 
(b) | | | 4х 4у 42 
x=0/y=0/z=0 


Pay particular attention to the order of integration! 
(c) Using a double integral, obtain the area of an ellipse and the position of 
the center of mass of the half ellipse (x > 0). 
The equation of an ellipse is ^ + p= 1 
13.4 Coordinate Systems 


3. (a) A point has Cartesian coordinates P = (3,3). What are its polar coordi- 
nates? 
(b) Give the equation of a circle of radius R in Cartesian coordinates and 
polar coordinates. 


Fig. 13.22 
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(c) Obtain the equation of the spiral shown in Fig. 13.22 in polar coordi- 
nates. 


л/4 га 
(d) Evaluate J 1 г2соѕ0 dr dé. 
0-0 Jr=0 


4. (a) Compute the volume of the hollow cylinder shown in Fig. 13.23 using 
cylindrical coordinates. 


4 


Fig. 13.23 


(b) Evaluate the volume of а cone of radius А and height л. Obtain the 
moment of inertia of the cone about its center axis. The density p is 
constant. 


5. Calculate the moment of inertia of a sphere of radius R and of constant 
density р about an axis through its center, using spherical coordinates. 


Chapter 14 
Transformation of Coordinates; Matrices 


14.1 Introduction 


One important aspect in the solution of physical and engineering problems is the 
choice of coordinate systems. The right choice may considerably reduce the degree 
of difficulty and the length of the necessary computations. 


Consider, for example, the motion of 
a spherical particle down an inclined 
plane, as shown in Fig. 14.1. 


Fig. 14.1 


The force of gravity, F = mg, directed vertically down can be resolved into two 
components, one parallel to the inclined plane and the other perpendicular to the 
plane, as shown in the figure. 

The component parallel to the inclined plane is 

Е р = те ѕіпа 


and ће component perpendicular to the plane is mg cosa. 


Fig. 14.2 
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To analyse the motion of the particle we must define a frame of reference, i.e. 
a suitable coordinate system. Two obvious possibilities exist: 


(a) the x-axis horizontal (Fig. 14.2a); or 
(b) the x-axis parallel to the inclined plane (Fig. 14.2b). 


Of course the actual motion of the particle is totally independent of the choice of 
a coordinate system. But it is important to note that a judicious choice of coordinates 
can simplify the calculations, as we will now demonstrate. 

First let us consider case (a). As the particle rolls down the incline, it has motion 
in both the x and y directions. In order to determine the motion, we need to divide 
the force F p into its components in the x-direction and the y-direction (Fig. 14.3a): 


F рх = Е рсоѕа = mg sing cosg 
Е ру = Fpsina = —mg sina sina 
According to Newton’s second law of motion, 


mx = mg sina cosa 


my = —mg sin? a 


The negative sign takes care of the fact that the direction of the force and the chosen 
positive direction of the y-coordinate are opposite to each other. 


Fig. 14.3 


Now let us consider case (b). The motion is restricted to the x-direction 
(Fig. 14.3b). Hence 


mx = mg sina 


my = 0 


These equations аге obviously much simpler than those for case (a). 

This example shows the importance of the choice of coordinates; in fact in some 
cases a problem can only be solved through the choice of appropriate coordinates. 
Therefore, before commencing the solution of a problem we should spend a little 
time selecting the most appropriate system of coordinates. 
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In very complex problems it sometimes happens that during a calculation it be- 
comes apparent that a different choice of coordinates would have been more sen- 
sible. In such cases, we can either start again from scratch, or, if a lot of work has 
already been done, transform the old coordinates into new ones. 

In this chapter we consider the second alternative, i.e. the transformation of 
a rectangular x— y —z coordinate system into a different rectangular x' — y'—z' co- 
ordinate system, as illustrated in Fig. 14.4. 


Fig. 14.4 


The following two transformations are particularly important. 


Translation 


The origin of the coordinates is shifted by a vector ro in such a way that the old and 
the new axes are parallel (Fig. 14.52). 
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Rotation 


The new system of coordinates is rotated by an angle ¢ relative to the old system. 
(Figure 14.5b shows, as an example, a rotation about the x-axis through an angle ф.) 


A (b) 


x and x’ 


Fig. 14.5b 


A more general transformation of a rectangular coordinate system into a different 
rectangular coordinate system is composed of a translation and a rotation. In this 
book we will not consider inversion. For this topic and for general transformations 
of coordinate systems the reader should consult advanced mathematical texts. 


14.2 Parallel Shift of Coordinates: Translation 


Figure 14.6 shows a point P whose position is defined by the vector r = (x,y,z) in 
an x— y —z coordinate system. 


Fig. 14.6 
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We now shift the origin O of the coordinate system to a new origin O' by the 
vector r o = (хо, Yo, zo), as shown in Fig. 14.7, and denote the new set of coordinates 
by the axes x’, y’ and 27. 

What are the coordinates of P in the new system of coordinates? The vector r in 
the x— y—z system corresponds to the vector r’ in the x'— y/—z' system of coordi- 
nates. From Fig. 14.7 we see that 

r=rotr’ 


or r'=r—ro 


This is the required vectorial transformation when the axes remain parallel. 


Fig. 14.7 


When expressed in terms of the coordinates, we obtain the transformation rule 
for a shift. 


Transformation rule Ifan x—y—z coordinate system is shifted by a vec- 
tor ro = (хо, yo, zo), the coordinates of a point in 
the shifted x' — y'—z' system are given by 


х= х= х0 х= х' + х0 
у=у-уо or у= у +уо 
z!—z-z z=z'+29 (14.1) 


Example Consider a position vector r = (5,2,3) of a point P. Now shift the coor- 
dinate system by the vector r o = (2, —3,7). Calculate the position vector in the new 
system. 
According to the transformation rule we have r' = r — ro 
Xce288 
y 22-(-3)55 
2-3-7--4 
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Hence P in the x/— y'—z' system is given by the position vector r’ = (3,5, —4). 

To make clear how useful it can be to shift coordinates, let us consider an- 
other case. 

Figure 14.8a shows a sphere of radius R whose center O' does not coincide with 
the origin of an x— y—z coordinate system. 


(a) (b) 
Fig. 14.8 


Let us investigate the equation of the sphere in two sets of coordinates. The center 
of the sphere is fixed by the position vector ro = (хо, yo, zo). The position vector 
for an arbitrary point P on the sphere (Fig. 14.8b) is 


r=ro+R or R=r-ro 


Taking the scalar product, we get the equation of the sphere: 


Е.К = К? = (х – хо)? + (у — yo)? + (2 – 20)? 


Fig. 14.9 
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Now we will consider an x’— у —z' coordinate system which is obtained by shifting 
the old system by the vector ro. 

The new origin of the coordinate system coincides with the center of the sphere 
(see Fig. 14.9). The equation of the sphere in the x/— y'—z' coordinate system is 
well known to be 

К2 = х2 фу? 422 


This equation is obtained by applying the transformation rule to the previous equa- 
tion. 

Hence the equation of a sphere and other equations as well can often be made 
simpler by shifting the origin of the coordinate system. 


14.3 Rotation 


14.3.1 Rotation in a Plane 


Consider the position vector r = xi + yj in an x—y system of coordinates. We 
can now rotate this system through an angle $ into a new position, as shown in 
Fig. 14.10. The new coordinate axes are denoted by x’ and y’ and the unit vectors 
by i’ and j^, respectively. 


Fig. 14.10 


In the x'— y' coordinate system the vector r is given by 
r= xli А + Ур 
The problem now is to find the relationship between the original coordinates (х, y) 
and the new coordinates (x’, y^). 
We start with the components (x, y) of r in the original system. These are sepa- 


rated into components in the direction of the new axes; we need to find these com- 
ponents. Finally, we will collect corresponding terms. 
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From Fig. 14.11, we have 
xi =xcosgi'—xsingj’ for the x component 
yj = уѕіпфі' + усоѕфј' for ће у component 
In the original system the vector r was given by 
r=xi+yj 


In the new system, the vector r is obtained by using the relationships for xi and 
yj. Thus 


r = xcos¢i’—xsindj'+ysingi’+ycos¢j’ 
or г =(xcos¢+ysing)i’+(—xsing + ycos¢) j’ 


The expressions in brackets are the components x’ and y’ in the new coordinate 
system: 


x = хсоѕф + ysing 
y = —xsin$ + ycosó 


=x sin ġ j’ * —xsin pj’ 


Fig. 14.11 
By the reverse argument, the vector r = (x,y) is obtained from r = (x’,y’) by 
replacing ф by —ф. We get 


x = х'соѕф — y'sinó 
y = х'ѕіпф + y'cosó 
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Rule If a vector r = (x,y) is transformed into the vector r = (x’, y^) 
when a two-dimensional system of coordinates is rotated through 
an angle ф, the transformation equations are 


х! = xcos@+ysing 
у! = —xsin$ + ycosó 
x = х'соѕф — у'ѕіпф 


y = х'ѕіпф + у'соѕф (14.2) 


Example Given the position vector r = (2,2) of a point P in ап x—y system, what 
is the position vector of this point when the coordinate system is rotated through an 
angle of 45°? 

r is transformed by the rotation according to (14.2) as follows: 


x'i' = (2cos45? + 2sin45°)i! = 2/2i! 
y' j' = (-2sin45? + 2c0s 45°) j =0x j’ 
Hence r is given in the rotated system by r = (22,0) as shown in Fig. 14.12. 


It is obvious that in the new system the y’-component vanishes since the x'-axis 
coincides with r. 


Fig. 14.12 


Example Given the hyperbola x?— y? = 1 in ап x— y coordinate system, what is 
the equation of this same hyperbola in an x’—y’ coordinate system after the coordi- 
nate system has been rotated through an angle of —45?? 

From (14.2), we have 


Qus 
x = x'cos(—45?) — y'sin( 45°) = 2 V2(x + y") 


$m 
y = x'sin(—45?) + y’cos(—45°) = 5; 20" x) 
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Substituting these into the equation x?— y? = 1 gives 


БИЕ = | У207-3)| PS. 


1 1 
or 30 + 2x'y' + y?) - 307 2x y' ex?) = 1 


1 
H РЭН: ТЭЭ 
епсе УЕ: OP сул ыг) 
This is the required equation of the hyperbola in the x'— y' coordinate system. 
Let us consider the reverse problem. 
Given xy = 1/2, we rotate the system of coordinates through an angle of +45°. 
Then the equation of the hyperbola in the new x’—y’ system of coordinates is 
1 
(x' cos 45° — y' cos45?) (x'sin45? + y'cos45?) = 3 
Непсе x'2—y?2] 


14.3.2 Successive Rotations 


We will now derive transformation equations for the case when an x— y coordinate 
system is rotated through an angle $ into an x'— y’ system and then taken through 
a further rotation y into an x"— y" system. We require the formula describing the 
transformation from the x— y system into the x"— y" system. 


æ 
> 
» 


рана Аро RP ate E 


Fig. 14.13 


We note from Fig. 14.13 that the two successive rotations, ф and ү, are equal to 
a single rotation, ф + ү. We will show analytically that this assumption is justified. 
From (14.2) we have 


(a) the rotation of ће x— y system into the x'— y’ system through an angle $ gives 
y Sy y Sy 8 8 8 


xcosd+ ysing 


xm 
у! = —xsinó + усоѕф 
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(b) the rotation of the x'—y' system into the x" — y" system through an an- 
gle v gives 
x" = x' coss + y'sin y 
y" =—x'sinw+y' cosy 
Substituting the expressions for x’ and y’ in the last two equations gives 
x" = (хсоѕф + уѕіпф) cosy +(—xsing + усоѕф) ѕіп y 
n 


y” = —(xcosó + уѕіпф)ѕіпу + (—xsing + y cos) cosy 


Expanding, rearranging and using the addition theorems of trigonometry 
(Chap. 3, (3.10)) gives the rule for successive rotations. 


Transformation rule for successive rotations in the x— y plane: 


x” = хсоѕ(ф+ V) + ysin($ + V) 
y" = —xsin( + Y) + ycos(9 + V) 
Reverse transformation: (14.3) 
x = x" cos(9 + y) — y"sin($ + V) 
y = x"'sin($ + у) +y” cosl +y) 


The vector r = (x, y) is transformed into the vector r = (x", y") if the coordinate 
system is rotated successively through the angles ф and y. 
Thus our assumption was correct and the rule is established. 


14.3.3 Rotations in Three-Dimensional Space 


In this section, we will restrict ourselves to rotations about one of the coordi- 
nate axes. 
Case I: Rotation about the z-axis through an angle $ (Fig. 14.14). 


Fig. 14.14 
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The x-axis is rotated to the x'-axis and the y-axis to the y'-axis, the z-axis re- 
maining unchanged, i.e. the z- and z'-axes coincide. 

It follows that the z-component of a vector r = (x, y, z) is unchanged for a rota- 
tion $ about the z-axis, i.e. Z’ = Z. 

The transformations of the x- and y-components are those of a rotation through 
an angle ф in a plane. The transformations from the x— y plane to ће x'— y’ plane 
are thus the same as in (14.2) in Sect. 14.3.1. 

х! = xcos¢+ ysing 
= —xsin$ + y cosó 
А 


2 = 2 


Case 2: Now consider a rotation about the x-axis, as shown in Fig. 14.15. 


Fig. 14.15 


In this case we see that y > y’,z — z' and x’ = x; the transformation takes place 
in the y—z plane. 
From equations (14.2) we find 


y! = усоѕф +25іпф 
2! = —уѕіпф +zcosd 
XE 


Similarly, for rotations about the y-axis the transformation would take place in the 
x—z plane and у! = y. 
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Rule For a rotation about the z-axis of a three-dimensional x— y —z sys- 


tem through an angle ф, the transformation equations are 


/ 


X =xcosd+ysing 


/ 


y = —xsinġ + усоѕф 


z = 2 


For а rotation about the x-axis, they are 


А 
=x 
А 


y = усо5ф +2510 ф 
2 = —ysin$ --zcos$ 


For a rotation about the y-axis, they are 
x = xcosQ + zsinó 


Yo ey. 
z =—xsingd+zcosd 


(14.4) 


(14.5) 


(14.6) 


We have obtained transformation equations for rotations about the x-, y- or z- 
axis only. Successive rotations can be described as a single rotation about some axis. 
Conversely, any rotation about any given axis can be described as a succession of 
rotations about the axes of the coordinate system. Details of this can be found in 


more advanced texts on algebra. 


14.4 Matrix Algebra 


Matrix algebra is a powerful tool in linear algebra. It also has the advantage of being 
very concise. The transformation equations in the preceding sections can be more 


clearly arranged by introducing the concept of matrix operations. 


Definition A rectangular array or set of real numbers is called a real matrix. 


411 412 `: Gin 
421 422 `` ап 


Amı m2 --- атп 
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The horizontal lines of numbers are referred to as rows of the matrix. 


This example shows the second row. 


The vertical lines of numbers are referred to as columns of the matrix. 


This example shows the second column. 


Please note that in this book we are dealing only with real matrices (as opposed 
to complex matrices). Also, all vectors considered are real vectors (as opposed 
to complex vectors). 

A matrix with m rows and n columns is said to be an m x n matrix or an 
(m,n) matrix. When m Æ n the matrix is rectangular, and when m = n it is 
square. Ап m х n matrix is said to be of the order m x n. 

Matrices are often denoted by a bold upper-case letter or by an upper-case 
letter underlined, e.g. A in a manuscript. 


Example 
411 412 
А = | a21 a22 | isa3x2 matrix, or a matrix of order 3 x 2 = 6 
азі 432 


This matrix can also be written thus: 


А = (а) with i= 1,2,3 
k=1,2 


The numbers aj; are called the elements of the matrix; the first subscript, i, refers 
to the row and the second, k, to the column. 

In the case of square matrices, the elements a;; are found on a diagonal which is 
called the leading diagonal. 
Matrices with one row or one column only are referred to as vectors. 

Column and row vectors are denoted by bold lower-case letters. 

For example a row vector is given by 


а = (а) (п) = (411 ü12 22 din) = (a1 a2 P ан) 
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For example a column vector is given by 


ац ay 

d21 аз 
а-(аж)па)-1 . |= 

ап an 


14.4.1 Addition and Subtraction of Matrices 


Definition The sum (or difference) of two matrices A and B of the same 
order m x n is another matrix C of the order m x n whose el- 
ements сү, are the sum (or difference) ар, + big of the corre- 
sponding elements of matrices A and B. 


11 = bi 412 = 512 Uc d]g c big 
а21 = b21 422 = рээ UU An E bon 
С=А+В = 1 
ат Ebm coe атп + bmn 
(14.7) 
Example Given 
3-1 4 10 -2 5-8 0 
А = 13 3-2 and B—|[-4 1-3 0 
-7 1 5 3 1-3 0-1 


obtain A+ B. 
We simply add the elements with the same subscript, i.e. in the same posi- 
tion. Hence 


1 4-4 10 
C=A+B=|-3 4 0 -2 
-6-2 5 2 
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14.4.2 Multiplication of a Matrix by a Scalar 


Definition А matrix A multiplied by a scalar quantity k is a new matrix 


whose elements are multiplied by К. (14.8) 
Example 
if qe o a12 2 
421 422 423 
2 kay Каџә Каз 
then kA = te M 8 
Example 
5 equ 
If 4-( 2 2) and к= 2.5 
12.5 —17.5 —2.5 
then КА = ( 15 5 5 ) 


14.4.3 Product of a Matrix and a Vector 


We illustrate the product of a matrix and a vector by considering the 2 x 2 matrix 
ayia 
A = [91 %2 
421 422 
x 
and the vector r=(x,y) or r= ( ) 


We can therefore state the following definition. 


Definition The product Ar of a matrix A and a vector r is a new vector r' 
whose components are given by 


r= Ap a (21 412) (x) _ (ax tary (14.9) 
421 422) VY 421X +а22у 


The components x’ and y’ of the vector r’ are obtained by forming the product 
/ 


з x 
of rows and columns as for the scalar product. Hence, if r' = y) then 


x'—ajx-ajy апі y'-aoix-aa2y 
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Example Obtain Ar if A = ( 2 andr — C ) 


6 y 
/ 
ou (Хү (1-31(хү f( x 3у 
шанг G) H ( ) C) H p 
x 
If we have a 3 x 3 matrix and a three-dimensional vector | y | then 
Z 
х! 411 012 413 x 411X 1 d12Yy T4137 
У | = | 214422 423 У | = | 421Х--422У +а232 
2 азі 432 433) \z d31X +аз2у +4332 


Example Obtain r’ = Ar if r = (x,y,z) and 


1 0 3 
А-14-2 0 
0 0 5 
The solution is 
х! 1 0 3 x x+3z 
г-1У1-14-2 0 уУ|-14х-2у 
2! 0.0 5 2 52 


14.4.4 Multiplication of Two Matrices 


The product A B of two matrices A and B is defined as follows, provided that the 
number of columns in A is the same as the number of rows in B. 


Definition The product A B of a matrix A of order (m x n) and a matrix B 
of order (n x p) is a matrix C of order (m x p). The coefficients 
of the matrix C are denoted by 


cin (i = 1,2,...m; k = 1,2,... p). 
They are obtained by multiplying the ith row of matrix A by the 


kth column of B, both being considered as vectors and forming 
the ‘scalar product’ as follows: 


n 
Cik = Y, divbyk = апбуу diabak +: +4inbng (14.10) 


v=1 
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The following diagram indicates the way the coefficients of the matrix C = AB 
are generated. It shows how the coefficient c22 is calculated. 


2nd 
column 

by, by 

by bn 

b31 632 


a a a (11 € 
2nd row 11 12 13 11 €12 
421 422 423 C2] €22 


This shows what is meant by saying that we form the scalar product of the ith 
row (2nd in this example) of matrix A by the kth column (2nd in this example) of 
matrix B. 


Example Obtain the product of 


The solution is 
5 2\ (-3 7 —15+2 35-2 
AB = (5 i) ( idet 0-1 =) 


Example Find the product of 


A= and В-1-3-1 
0 1-4 A 5 
6 2 1 


A isa 4 x 3 matrix and B a 3 x 2 matrix. Hence the product will be a 4 x 2 matrix. 
The solution is 


2+0+4 04045 6 5 
ap —|4+21+32 0+7+40| | 57 47 
0—3—16 0—1—20 —19 —21 
12-644 0-2--5 10 3 


Note that the product of two matrices A and B is, in general, not commutative, 
ie. AB 5 BA. 
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14.5 Rotations Expressed in Matrix Form 


14.5.1 Rotation in Two-Dimensional Space 


The transformation equations for a rotation through an angle $ (Fig. 14.16) were 
obtained in Sect. 14.3.1 (14.2). They are 


x'— хсоѕф + ysinó 
/ 


y = —xsin$ + усоѕф 


Fig. 14.16 


These equations can now be expressed in matrix form to give the new vector as 
a product of a rotation matrix and the original vector: 


x'\ _ / соѕф sing\ (х 
y]  \—sing cosd) Vy 


Example Let us rotate an x — y coordinate system through an angle ф = л /2 so that 
the x-axis moves into the y-axis and the y-axis into the negative x-axis. Calculate 
the rotation matrix. 

Substituting the value of л /2 into the general rotation matrix we get 


cos 4 sin 5 ( 0 ) 
—sin 5 cos + -10 


The transformation of coordinates is obtained: 


5) =i G) =) 


We now determine the matrix for successive rotations ф and ү. 
We first rotate through an angle $. Thus x goes to x’ and y to y’. Second, we 
rotate through an angle y. Thus x’ goes to x" and y’ to y". The equations obtained 
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in Sect. 14.3.2 look in matrix form as follows: 


x'\ ( совф sing \ (x 1 
Уу!) 7 \=sing cosd) Vy | 
x"\ ( cosy sinw\ [x 

& Е 2 20 b [2] 


x"N (cosy siny со8ф sin$ /x 
У”)  V—siny cosy) V —sin$ совф/ Vy 


Multiplying out these matrices gives 


x"\ [f cosy cos —sinysinó — cosy sing + sin y coso x 
"}  V—siny coso — cosy sin$ — sin y sing +cosy coso) Vy 


By applying the addition theorems in trigonometry, the transformation matrix finally 


becomes 
( cos(p+ V.) sin($-- ») 
—sin(d + y) соѕ(ф +) 


14.5.2 Special Rotation in Three-Dimensional Space 


In Sect. 14.3.3 we derived the transformation equations for a rotation about the z- 
axis (Fig. 14.17) through an angle 9. 
These transformation equations are 


x’ = хсоѕф + ysing 
y! = —xsing+ycosd 


z = 2 


Fig. 14.17 
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We can express these equations in matrix form thus: 


x! соѕф sing 0\ /х 
y' | =| —sing соѕфо | | у 
2! 0 0 1 2 


The transformation matrix for rotation about the y-axis through an angle w is 


cosy 0 siny 
0 1 0 
—siny 0 cos y 


You should verify this for yourself. 


14.6 Special Matrices 


In this section we introduce the definitions of some important special matrices. 
Some assertions will be made without proof. 


Unit Matrix 
A unit matrix is a quadratic matrix of the following form: 


100 
1—|010 
001 


All elements on the leading diagonal are unity and all other elements are zero. 
If a vector r or a matrix A is multiplied by a unit matrix J, the vector or the 
matrix remains unchanged. 


Ir —r 
IA = А 


These relationships are easy to verify. 


Diagonal Matrices 


A diagonal matrix is a quadratic matrix whose elements are all zero except those on 
the leading diagonal. 


11 0 0 
D= 0 a22 0 
0 0 33 


A unit matrix is thus a special diagonal matrix. 
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Null Matrix 

A null matrix is one whose elements are all zero; it is denoted by 0. We should note 
that if A B = 0, it does not necessarily follow that A = 0 or B = 0. 

Transposed Matrix 

If we interchange the rows and columns of a matrix A of order m x n we obtain 


a new matrix of order n x m. This matrix is called the transposed matrix or the 
transpose of the original matrix and it is denoted by AT or A. 


e 411 421 431 
If A—[a21a22|] then АГ- (14.11) 
412 422 032 
4з1 432 


Note that A! is the mirror image of A. 


Example 


20 0 2 
If A—-|[2 1-6| then АГ-10 
6 0-1 


Note that the first row becomes the first column, the second row becomes the second 
column, etc. 
You should verify the following assertions for yourself. 


1. The transpose of the transposed matrix gives the original matrix A. 


(AT  — A 
2. An important relationship is 
(AB)! = ВТАТ 
and generally (ABC... 2) = Z"...BTAT 


Orthogonal Matrices 


A square matrix A which satisfies the following identity is called an orthogonal 
matrix: 


AAT = 1 (orthogonality) (14.12) 


This relationship is equivalent to 


ATA- I 
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It can be interpreted in terms of rows and columns of the matrix A as follows. The 
nth column of A is the nth row of AT. Now consider the equation AA! = J. If 
we think of rows and columns as vectors and compute their scalar product, then we 
observe for an orthogonal matrix A that 


1. the scalar product of a column by itself is 1; 
2. the scalar product of a column by a different column is always zero. 


The following assertions are equivalent: 


3. the scalar product of a row by itself is 1; 
4. the scalar product of a row by a different row is always zero. 


For example, the matrices describing rotations are always orthogonal matrices. 
The name ‘orthogonal’ is derived from the fact that if an orthogonal matrix A is 
applied to two vectors r and s, their scalar product remains unaffected, i.e. 


r:s = (Ағ): (As) 
This implies that the lengths and angles of the vectors are preserved, and, in partic- 


ular, a system of orthogonal coordinate axes is transformed into another orthogonal 
system. 


Singular Matrix 

A matrix whose determinant is zero is called a singular matrix (for determinants see 
Chap. 15). 

Symmetric Matrices and Skew-Symmetric (or Antisymmetric) Matrices 


For square matrices two new properties may be relevant. A square matrix is called 
symmetric if for all i and ja;; = a ji. This means it equals its transpose. 


А = АТ (symmetry) (14.13) 


A square matrix is called skew-symmetric or antisymmetric if all aj; = —a ji. 
This means it equals the negative of its transpose. Note that for antisymmetric ma- 
trices all elements on the leading diagonal are zero. 


А--АТ (skew-symmetry) (14.14) 


It is useful to note that any square matrix can be expressed as the sum of a sym- 
metric and a skew-symmetric matrix. 
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Proof 
A= 3(A-- A7) 4-5 (A - A7) 


Observe that the first term is a symmetric matrix and the second term is a skew- 
symmetric matrix. 


Example Write as the sum of a skew-symmetric and a symmetric matrix 


798 29 26 
A= 1 827 
74 69 88 
The solution is 
0 14 —24 798 15 50 
А-1-14 0-211- 15 848 
24 21 0 50 48 88 


14.7 Inverse Matrix 


If a square matrix A when multiplied by another matrix B results in the unit matrix, 
then the matrix B is called the inverse matrix of A; it is denoted by A~!. Not all 
matrices possess an inverse, and the criterion for a matrix to have an inverse is that 
its determinant must be different from zero, i.e. it must not be singular. Determinants 
are dealt with in Chap. 15. If an inverse exists it is unique. 

The following equations must hold true: 


AA"! —I, (post multiplication by A^!) 
ATA =I, (pre multiplication by А!) (14.15) 
We will not give details here of how А! is calculated. This will be done in 


Chap. 15, Sect. 15.2.3. For the time being we will give only the following example 
for an inverse matrix. 


200 100 
If A- [21-6] then А47! = | 171-6 
60-1 30 =1 


As an exercise, you should verify for yourself that 
AA = АТА =1 


Returning to the concept of orthogonal matrices introduced earlier, we сап now state 
the following criterion. A square matrix A is orthogonal if its inverse is equal to its 
transpose, i.e. if A7! = AT. 
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If the operations with matrices have been understood using simple examples you 
will do the calculations later on using a PC and programs like Mathematica, Maple, 
Derive or others. 


Exercises 


14.2 Parallel Shift of Coordinates 


1. The vertex of the paraboloid shown in Fig. 14.18 is at a distance 2 from the 
origin of the coordinates. The equation is 


z-24x^4 y? 


What is the transformation which will shift the paraboloid so that its vertex 
coincides with the origin O? 


x 


Fig. 14.18 


2. The equation of a certain straight line is y = —3x + 5. What will its equation 
be in a new x'— y' coordinate system due to a shift of the origin of (—2,3)? 
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14.3 Rotation 


3. A two-dimensional system of coordinates is rotated through an angle of 
л /3. The transformation matrix is 


125523 
ay 55, 
V3 1 
208272 


What is the new vector r' if r = (2,4)? 


4. Given the equation y = —x/ V3 + 2, if the system of coordinates is rotated 
through an angle of 60° obtain an expression for the equation in the rotated 
system. 


5. A three-dimensional system of coordinates is rotated about the z-axis 
through an angle of 30°. Obtain the transformed vector r' if r = (3,3,3). 


14.4 Matrix Algebra 


6. Given the two matrices A and B where 


13 20 
А-|25 and В-1-13 
07 -12 
evaluate 
(a) A+B, 
(D А-В 
7. Let 
2527 
А-13 0 and 2 
9-1 


(a) Evaluate the matrix 6A. 
(b) Show that the expression АВ 52 BA. 


8. Given 
12 -1 0 
А= (73 and B= 2 3 
59 -1-1 


evaluate АВ 
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9. Evaluate the product Ar = r’ if 


03. 7-0 


10. Given 


evaluate 


(a) AT 
(b) (AT)T 


11. How many independent entries are there in a skew-symmetric 3 x 3 matrix? 


12. Decompose into a symmetric and a skew-symmetric matrix: 


54 1 1 
0 26 20 
8 84 9 
14.7 Inverse Matrix 
13. If 
1 0 3 1 —8 
А-|2-3 1| and A im -3-1 5 
1 2:2 7 E 


show that 44A = АА = 


Chapter 15 
Sets of Linear Equations; Determinants 


15.1 Introduction 


In this chapter we will investigate the solution of sets of linear algebraic equations. 
First, we show a method which will be used in most practical cases. This is the 
Gaussian method of elimination and its refinements. The basic idea is quite clear 
and elementary. Notation in matrix form will prove to be helpful. 

Second, the concept of determinants and a second method of solution, Cramer’s 
rule, will be developed. This concept is of theoretical importance, e.g. a determinant 
shows whether a set of simultaneous equations is uniquely solvable. 


15.2 Sets of Linear Equations 


15.2.1 Gaussian Elimination: Successive Elimination of Variables 


Our problem is to solve a set of linear algebraic equations. For the time being we 
will assume that a unique solution exists and that the number of equations equals 
the number of variables. 

Consider a set of three equations: 


ахі +412X2 +413x3 = bı 


а21Х1 + d22X2 +a23x3 = b2 


азі хі - d32X2 + аззхз = b3 


The basic idea of the Gaussian elimination method is the transformation of this set 
of equations into a staggered set: 


/ / / / 
a 11X1 +4 12X2 +4 13X3 = bi 


/ / / 
а 22X2 +а эзхз = b» 


/ / 
а 33x3 = b3 


К. Weltner, W.J. Weber, J. Grosjean, P. Schuster, Mathematics for Physicists and Engineers 
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All coefficients a^;; below the diagonal are zero. The solution in this case is straight- 
forward. The last equation is solved for x3. Now, the second can be solved by in- 
serting the value of x3. This procedure can be repeated for the uppermost equation. 

The question is how to transform the given set of equations into a staggered set. 
This can be achieved by the method of successive elimination of variables. The 
following steps are necessary: 


1. We have to eliminate x, in all but the first equation. This can be done by sub- 
tracting a21 /a 1 times the first equation from the second equation and a31 /a11 
times the first equation from the third equation. 

2. We have to eliminate x2 in all but the second equation. This can be done by 
subtracting 432 /a22 times the second equation from the third equation. 

3. Determination of the variables. Starting with the last equation in the set and 
proceeding upwards, we obtain first хз, then x2, and finally хі. 


This procedure is called the Gaussian method of elimination. It can be extended to 
sets of any number of linear equations. 


Example We can solve the following set of equations according to the procedure 
given: 


бху — 12x2 + 6x3 = 6 [1] 
3x1 ож 5Х2 E 5xa = 13 [2] 
2x1 rz 6x5 +0 = —10 [3] 


1. Elimination of x1. We multiply Eq. [1] by 3/6 and subtract it from Eq. [2]. Then 
we multiply Eq. [1] by 2/6 and subtract it from Eq. [3]. The result is 


бху — 12x2 + 6x3 = 6 [1] 
хэ +2x3 = 10 [2] 
-2Хэ2 — 2х3 --12 (37 


бху — 12x24- 6x3 = 6 (17 
x2 +2x3 = 10 [2’] 
2x3 = 8 [3"] 


3. Determination of the variables x1,x2,x3. Starting with the last equation in the 
set, we obtain 


325-4 


Now Eq. [2/] can be solved for хэ by inserting the value of хз. Thus 


х= 2 
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This procedure is repeated for Eq. [1] giving 


Xp 


15.2.2 Gauss-Jordan Elimination 


Let us consider whether a set of n linear equations with n variables can be trans- 
formed by successive elimination of the variables into the form 


m+ 04 04.4 020 
O+x+ 04-04 0 = С 
0+ 0+x3 ++ 0 = Сз 
0+ 0+ 0x = Cn 


The transformed set of equations gives the solution for all variables directly. The 
transformation is achieved by the following method, which is basically an extension 
of the Gaussian elimination method. 

At each step, the elimination of x; has to be carried out not only for the co- 
efficients below the diagonal, but also for the coefficients above the diagonal. In 
addition, the equation is divided by the coefficient a jj. 

This method is called Gauss-Jordan elimination. 

We show the procedure by using the previous example. 

This is the set 


6x1 — 12x2 + 6x3 =6 
3x1 = 5x2 +5x3 = 13 
2Х)- 6x2+0 =-10 


To facilitate the numerical calculation, we will begin each step by dividing the re- 
spective equation by a jj. 


1. We divide the first equation by аџ = 6 and eliminate x, in the other two equa- 


tions. 
Second equation: we subtract 3 x first equation 
Third equation: we subtract 2 x first equation 
This gives 
X1 — 2X2 + X3 21 [1] 
O+x2 +2x3 = 10 [2] 


0 — 2x2 — 2х3 --12 [3] 
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2. We eliminate x2 above and below the diagonal. 


Third equation: we add 2 x second equation 


First equation: we add 2 x second equation 


This gives 
xı +0 +5х3 = 21 [17] 
0+ x2 + 2х3 = 10 [27] 
0+0 +2хз = 8 [37] 
3. We divide the third equation by a33 and eliminate x3 іп the two equations 
above it. 
Second equation: we subtract 2 x third equation 
First equation: we subtract 5 x third equation 
This gives 
X190 +0 =1 [1"] 
O+x2+0 =2 [2"] 
0+0 +x3=4 [3"] 


This results in the final form which shows the solution. 


15.2.3 Matrix Notation of Sets of Equations 
and Determination of the Inverse Matrix 


Let us consider the following set of linear algebraic equations: 


a11X1 + d12X2 - d13X3 = by 
а21Х1 + d22X2 + d23X3 = b2 
431X1 - d32X2 + d33X3 = b3 


This set of equations can formally be written as a matrix equation. Let A be a matrix, 
whose elements are the coefficients a;;. It is called a matrix of coefficients. 


411 d12 d13 
A = | a21 422 423 
431 432 433 


x and b are column vectors: 
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The set of equations can now be written 
Ax=b 


In Chap. 14 we discussed the rules for the multiplication of matrices, the concepts of 
the inverse matrix A ^! and of the unit matrix J. You will remember that A^! A = J. 
Let us consider a matrix equation representing a set of linear algebraic equations: 


Ax=b 


We will now multiply both sides of this matrix equation from the left (premultipli- 
cation) by the inverse of A. 
A lAx-A lb 


Since A^! A = I, we obtain 
Ix 2 A^! b 


This equation is in fact the solution of the set of linear equations in matrix notation. 
But at present we do not have the inverse A^! of the matrix of coefficients A to 
perform this multiplication. On the other hand, we do know a method for solving 
a set of linear equations, e.g. the Gauss-Jordan elimination. We want to find if a re- 
lationship exists between the solution of a set of equations and the determination of 
Act 

Without giving the proof we can state the answer. We transform the matrix of 
coefficients A by the Gauss-Jordan elimination into a unit matrix J. If we apply ай 
operations simultaneously to a unit matrix J, the latter will be transformed into the 
inverse A 1. 

Thus we do not, in practice, gain a new method for solving a set of linear equa- 
tions, but a method for calculating the inverse of a given matrix. 

Consequently, if we form the inverse of the matrix of coefficients A and premul- 
tiply matrix b by it, then we obtain as a column vector the solution of x. 

An n x m matrix can formally be augmented by another n x o matrix B thus 
forming an augmented n x (т + о) matrix denoted А |В. For example, А |1 is an 
augmented matrix whose first part consists of A and whose second part consists 
of I. 


Rule Calculation of the inverse А! of a matrix А. 
Augment A by a unit matrix I. Execute the Gauss-Jordan elimina- 
tion to transform the first part A of the augmented matrix into a unit 
matrix. Then the second part Z will be transformed into A ^! . 


As an example, we will show the calculation of the inverse matrix of A cited in 
Sect. 14.7. Consider 
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We extend A by J and get the augmented matrix A |I: 
0.01 0 0 
AJM-|[2 1-60 1 0 
6 0-10 0 1 


Now we carry out the Gauss—Jordan elimination to transform its first part A into the 
unit matrix, following the steps described in the previous section. 


1. Division of the first row by a11 = 2 and elimination of the elements of the first 
column below the diagonal results in 


Ww = јн 


1 0 0 
0 1-6|— 
0 0-1|— 


0 0 
1 0 
0 1 


2. The elements of the second column above and below the diagonal are already 
Zero, so nothing has to be done for this step. 

3. Division of the third row by a33 = —1 and elimination of the element above in 
the third column results in 


100200 
0 1 0117 1-6 
00 цз 0-I 


The second part of the augmented matrix represents A adi; 


12 0 0 
A'-2[17 1-6 
3 0-1 


Further, we make use of the matrix notation to facilitate the writing while transform- 
ing the system of equations. 

Each row of the matrix equation Ax = b represents a linear algebraic equation. 

Suppose we multiply row i by a factor. Then all terms aj; x;(j = 1 ... n) and b; 
have to be multiplied by this factor. This is carried out by multiplying all elements 
in row i of the matrix of coefficients and b; by this factor. 

Suppose we add row i to row j. Then we have a new row whose coefficients are 
(а ал), (ai2+4j2),---, (din +. jn) and the value of bj is then (bj + bj). 

This equals the addition of corresponding elements of the coefficient matrix A 
of row i to row j and of b; to ру. It can be generalised for the addition of multiples 
of an equation and for subtraction of multiples of equations. 

Thus the Gaussian elimination method and the Gauss-Jordan elimination can 
be carried out by performing the transformations with the elements of the matrix of 
coefficients and with the corresponding elements of b. This can be done using matrix 
notation if we augment the matrix of coefficients A with the column vector b and 
transform this augmented matrix A|b according to the Gaussian or Gauss-Jordan 
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elimination. Then the first part A will be transformed into a unit matrix and the 
column b will be transformed into the column vector of solutions. This is more 
concise and reduces the chance of making errors. 


15.2.4 Existence of Solutions 


Number of Variables and Equations 


We know that from one equation we can only determine one unknown variable. If 
we have one equation and two variables, one of the variables can only be expressed 
in terms of the other. 

In order to determine n variables we need n equations. These equations must be 
linearly independent. An equation is linearly dependent if it can be expressed as 
a sum of multiples of the other equations. 

If we have n variables and m linearly independent equations (m < n), only m 
variables can be determined and n — m variables can be freely chosen. Let us ex- 
plain: in a system of m equations, (n — m) variables can be shifted to the RHS, 
m variables remain at the LHS. The Gauss-Jordan elimination can now be carried 
out, giving a solution for m variables. But this solution contains the n — m variables 
previously shifted to the RHS. Thus these are the freely chosen parameters. 

If m > n, the system is overdetermined. It is solvable only if m — n equations are 
linearly dependent. 


Existence of a Solution 


Let us consider a set of 7 linear equations containing n variables. If at any stage in 
the elimination procedure the coefficient a;; of a variable x; happens to be zero, 
the equation has to be changed for an equation whose coefficient of x; below the 
diagonal is Æ 0. If all coefficients of x; below the diagonal are zero too, the set has 
no unique solution or no solution at all. In this case, we proceed to the next variable 
and continue the elimination procedure. 

The set of equations has no unique solution if on the RHS of row j the value 
of bj is zero. This happens when the equation is linearly dependent on the other 
ones. The value of this variable is not determined and it is freely chosen. It should 
be added that if this happens r times we will have r variables freely chosen. 

This can be understood if we note that a row of zeros reduces the number of 
equations. In this case, the number of variables n exceeds the number of remaining 
equations (m = n — r) and, as has been stated above, n — m = r parameters аге 
freely chosen. 

The set has no solution at all if on the RHS of row j the value of 5; is not 
zero. In this case we have the equation 0 — 5;, which is impossible. Thus the set of 
equations contains contradictions and has no solution at all. 
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Solution of a Homogeneous Set of Linear Equations 


Consider a set of n linear equations with n variables. If all constants b; on the RHS 
are zero, we have what is referred to as a set of homogeneous linear equations. 
There is a trivial solution with 


x; —0, Jn 


A non-trivial solution may also exist. In this case, there must be at least one equa- 
tion linearly dependent on the others. Consequently, the solution is not unique and 
contains at least one parameter freely chosen. 


Example Given a set of linear equations 


Augmented matrix A |b 


4 
5 
4 8 
7 


We use matrix notation and carry out the transformations with the augmented matrix 
А|Ь. 


1. Division of the first row by ат and then elimination of the coefficients in the 
first column: Subtraction of row 1 from row 2, subtraction of row 1 multiplied 
by 2 from row 3, and addition of row 1 multiplied by 4 to row 4, gives 


1-2 0-1|-3 

0 3 3 6 15 

0 2 2 6| 14 

0 1 1 4 9 

2. Division of the second row by a22 and then elimination of the coefficients in the 
second column: Addition of row 2 multiplied by 2 to row 1, subtraction of row 


2 multiplied by 2 from row 3, and subtraction of row 2 from row 4 gives 


AAN 


3 
2 
2 
2 


ооо н 
оо н о 
оо н N 


3. In ће third column азз and all coefficients below the diagonal happen to be 
Zero. Thus we proceed to the fourth column. We divide the fourth row by a44 
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and eliminate the coefficients above. We obtain 


- о о о 


1 
1 
0 
2 


оо н м 


0 
1 
0 
0 


ооо н 


In the third row all elements аге zero. Thus the set has по unique solution. The value 
of x3 can be freely chosen and hence the values of x; and x2 depend on this choice. 


xı =1-2x3 
X2 =1-x3 
X4 —2 


Example Solve the following set of homogeneous linear equations. 


1 4-1 
4 16-4|х-0 
92:23:11 


Augmented matrix A|b 
1 4-1/0 
4 16-40 
2 -3 10 


1. Eliminating the coefficients in the first column gives 


1 4-10 
0 0 0/0 
0-11 3/0 


We see that the set has a non-trivial solution, since one row consists of zeros and 
is thus linearly dependent. 

2. Since a22 = 0, we interchange row 2 and row 3. Dividing the new diagonal 
element and eliminating the coefficient above the diagonal in the second col- 


umn gives 
1 0 400 
o 1-20 
11 
оо 0/0 


We are left with two equations for three variables. We write it down explicitly, shift 
the third variable to the RHS and obtain the solution: 


X1 = —1143 


X2 = 3 


—x 
11 
The variable x3 is freely chosen. Thus the solution is not unique: it contains one free 
parameter. 


438 15 Sets of Linear Equations; Determinants 


15.3 Determinants 


15.3.1 Preliminary Remarks on Determinants 


In this section on determinants we explain the concept and its properties. We 
give as an application the method of solving sets of linear equations known as 
Cramer's rule. 
We will introduce the concept of a determinant by means of an example. 
Consider two linear equations with two unknowns, хү and x2: 


аціх +412X2 = by ог (41412) (x1) bi 
а21Х1 + à22X2 = b2 421 422) \x2 b2 


These equations, when solved, give 


b1a22 — b2412 
X1 MÀ 
411422 — 412421 
роаџ — b1a21 
X2 = n 
411422 — 412421 
The solutions exist, provided that the denominators are not equal to zero. We notice 
that these denominators are the same for x, and x2. It is customary to express them 
as follows: 
а11 412 


= 411422 — 412421 
421 422 


This expression is called the determinant of the matrix A. If this determinant is 
different from zero, then unique solutions exist for x1 and x». 
The determinant is a prescription to assign a numerical value to a square matrix. 
For example, we can speak of the determinant of the 2 x 2 matrix A. There are 
several notations used in the literature: 


аа 
et ( "1! “1? | — det A = 
421 422 


The given formula applies only for the determinant of a 2 x 2 matrix. But the eval- 
uation of the determinant of a n x n matrix can be reduced successively to the eval- 
uation of determinants of 2 x 2 matrices. 

The solution of a set of two linear equations for x; and x2 can be expressed in 
terms of determinants: 


411 412 


= А = 011022 — 412421 
421 d22 


by a12 а11 bı 
M b2 аэ2 CS b2 
ie det A эж det A 


This is Cramer’s rule for two linear equations which will be dealt with generally in 
Sect. 15.3.4. 
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15.3.2 Definition and Properties of an n-Row Determinant 


Generally speaking, the determinant of a square matrix of order n (n rows and 
n columns) is referred to as an n-order determinant. Although the determinant is 
a prescription to assign one numerical value to a given square matrix consisting of 
n? elements, it is usual, before the numerical evaluation, to refer to elements, rows 
and columns of the determinant in the notation below. Nevertheless, it is essential 
to distinguish between a matrix, which is an array of numbers, and its determinant, 
which is a number. 


411412 ... Aik ... Gin 411412 ... Aik ... Ain 
det | аг а; ... Aik ... Gin | 5514 аә... Aik --- Gin 
ап] an2 ... Ank --- апп ап] 4и2 ... Ank --- апп 


With each element а; р is associated a minor found by omitting row i and column k. 
The minors are determinants with n — 1 rows and columns. 
The cofactor A;; is obtained by multiplying the minor of ад, by (—1) 
The procedure for evaluating the cofactor Д; р is shown below. 


itk 


Omit 


det A —||_ 1% » 


ithrow |А = (-1)** 


иштоо щу 


Finally, we define the expansion of the determinant by а row (or a column). It 
is defined by multiplying each element of the row (or column) by its cofactor and 
summing these products. 


Example We expand the given determinant by the first row. First we evaluate the 
cofactors of the first row: 


1 23 
det A —|3 21 
5-31 
Cofactor А: 
21 21 
Ay = (70! Bg a ee =2-(-3)=5 
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Cofactor А12: 


31 31 
A12 = (—1)!*? eal |= -1х3- (-)3=2 
Cofactor A13: 
A13 = (- 1) 24:14” 9-10--19 


Second, we multiply the cofactors by the elements a; and obtain the sum 
1х5+2х2+3(—19) = –48 


Without giving the proof, we state that the expansion of a determinant by different 
rows or columns always gives the same value. 


Evaluation of determinants 

The value of an n-order determinant is defined by the value of its expansion 
by any row or any column. 

Expanding by the ith row gives 


det A = а Aj, +а;2 А2 +--+: ап Ain 
Expanding by the kth column gives 


det A — ak Ak d doz Arp dd ask Ánk 


The value of the 3 x 3 determinant in the preceding example is thus given by the 
expansion which has already been obtained. 

The evaluation of a determinant with n rows and n columns is reduced to the 
evaluation of n determinants with (n — 1) rows and (n — 1) columns. Applying the 
rule again reduces it to determinants with (n — 2) rows and (n — 2) columns and so 
on until we are left with 2-row determinants. 

As a special case, it should be noted that the determinant of a diagonal matrix is 
given, up to sign, by the product of the diagonal elements. This follows if the given 
method is applied. 


Hints for the Expansion of Second- and Third-Order Determinants 


(a) Second-order determinants. The formula for the evaluation of a second-order 
determinant can be easily remembered with the help of the following scheme: 


aii _ y2 
tuit, 
he 
01-7 422 
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The value is given by the algebraic sum of the products formed by the elements 
on each of the two diagonals, the product taken downwards being positive, and 
that taken upwards being negative, i.e. 411422 — 421412. 

(b) Third-order determinant. We can establish a similar scheme for the expansion 
known as Sarrus' rule. 


Sarrus’ rule Repeating the first two columns of the determinant on the 
right, the expansion may be written down by taking the al- 
gebraic sum of the products formed by the elements on each 
of the six diagonals, as shown below; products taken down- 
wards are positive and products taken upwards are negative. 


Negative 
P 
ay an аз ап d 
Mae ce 
d an @3 ар d» 
ИЕ 
d3] 432 0433 3] а32 
ета 


Example Evaluate the determinant det А 


1 
- эм 
Wwe w 
| 
WN 


Solution | using the Sarrus’ rule: 


—2x1x443(-3)x1--5x2x3 
—]1x1x5—3(-3) x2—4x2x3 
-8-9-30-5-18-24--18 


Solution 2 using cofactors and expanding by the first column: 


2 3 5 
det A = |2 1-3 
134 
1-3 A 5 3 8 
— (_1)141 _4)2+1 _4)3+1 
=(-1) 2h j^ 1) 2. j^ 1) T 


= 2(44 9) - 2(12 — 15) + 1(—9— 5) 
= 26--6—14— 18 
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Properties of Determinants 


To evaluate determinants, in practice we frequently make use of the following prop- 


erties (considering third-order determinants only for simplicity) to simplify the 
working. 


Property 1 The value of the determinant is unaltered if columns and 
rows are interchanged (transposed). 


det A =det AT 


Since interchanging rows and columns does not affect the value of the determi- 
nant, any property established below for ‘rows’ also holds for ‘columns’. This will 
not again be mentioned explicitly. Thus 


411 412 413 411 421 431 
421 422 423| = |412 422 432 
431 432 433 413 423 433 


Property 2 If two rows of the determinant are interchanged, the absolute 
value of the determinant is unaltered, but its sign is changed. 


411 412 413 421 422 423 
a21 422 423| = — |а 412 413 (rows 1 and 2 are interchanged) 
431 432 433 431 432 433 


Property 3 If all the elements of one row of the determinant are mul- 
tiplied by a constant k, the new determinant is equal to 
kx (value of the original determinant). 


411 412 413 411 412 413 
det A = |kaz1 kaz2 kaoa| = k (21 a22 a23 
431 432 433 431 432 433 


If all the elements of the matrix are multiplied by a constant k the new determinant 
is equal to k” x (value of the original determinant). 


Property 4 If two rows of a determinant are identical, the value of the 
determinant is zero. This applies equally if two rows are pro- 
portional to each other. 
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Property 5 The value of a determinant is not altered by adding to the 
corresponding elements of any row the multiples of the ele- 
ments of any other row. 


411 412 413 a11 + kazı a12 + ka25 413--Ка23 k x (second row) 
a21 422 423| = a21 a22 a23 is added to first 
431 432 433 431 432 a33 TOW 


Property 6 If each element of any row is expressed as the sum of two 
numbers, the determinant can be expressed as the sum of 
two determinants whose remaining rows are unaltered. 


a11 +b1 412 +b2 a13 +b3| |а 412 аз bı b2 b3 
a21 an2 a23 |= |d21 a22 a23| + |a21 422 423 
a31 a32 азз 431 432 азз| |a31 432 433 


Property 7 If the elements of апу row are multiplied in order by the 
cofactors of the corresponding elements of another row, the 
sum of the products is zero. 


а11А21 + 412422 t d13423 = 0 


Using properties 2, 3, 5, any determinant can be transformed so that only diago- 
nal elements remain. The product of the diagonal elements is, except for the sign, 
the value of the determinant. This is equivalent to the Gauss—Jordan elimination. 
In practice, this method considerably reduces the amount of calculation involved in 
solving determinants of the fourth order and above. It should be noted that it is suf- 
ficient to eliminate the elements below the diagonal (Gaussian elimination), since 
the elimination of the elements above the diagonal does not affect the diagonal ele- 
ments. 

411412013) |P10 0 

421 422 423| = | 0 P2 0 | = Pi P2 P3 

431 432 433 0 0 P 


The following example illustrates the application of the above properties before 
expanding by a row or column. 


11 3 7 
Example Evaluate the determinant: |10 2 6 
5 1 4 


- 


Subtraction of row 2 from row 1 (property 5) gives |10 
5 


— N -e 
BRAK 
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1 1 1 
Subtraction of two times row 3 from row 2 gives |0 0 —2 
5 1 4 


1 1 1 
According to property 3, we can write —2}0 0 1 
5 1 4 


We interchange row 1 and row 2 (property 2) and evaluate the cofactor A}, obtain- 
ing 
0 0 1 
21 1 1=2(-4) = –8 
5 1 4 
The determinant can also be solved by transformation into a diagonal form. In this 
case we get 


1 0 0 
—-(42)-4)0 1 0|--8 
0.0 1 


15.3.3 Rank of a Determinant and Rank of a Matrix 


If det A Z 0, we define the rank r of an n order determinant as r — n. If det A — 0, 
the rank r is less than n. In this case, the rank of the determinant is defined by the 
order of the largest minor whose determinant does not vanish. Thus its rank r is m 
if a minor with m rows exists which is not zero, but all minors with more than m 
rows are zero. 

The rank of a square matrix is defined by the rank of its determinant. From am x 
n matrix, submatrices can be formed by deleting some of its rows or columns. The 
rank of a m x n matrix is the rank of the square matrix with the highest rank which 
can be formed. 


Example Evaluate the rank of the matrix and of its determinant. 


1). 2. dq 59, 
20 2 0 
det A = 1 ото 
De 9: 2 


It is not practical to evaluate the determinant of the matrix by calculating the minors 
as the calculation involved is rather tedious. We had better try to transform the de- 
terminant. If we subtract row 1 and row 3 from row 4, the latter becomes zero. If we 
subtract half of row 2 from row 3 the latter becomes zero. Hence 


1) 2 ae 9, 
2 0 2 0 
det A =|) ооо 
0 0 0 0 
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There are only minors of rank two which do not vanish. Thus the rank of the matrix 
and its determinant is 2. The same result is obtained if we notice that two pairs of 
columns of the original matrix are equal. 


15.3.4 Applications of Determinants 


Cramer’s Rule 


Cramer’s rule is a method for solving sets of linear algebraic equations using deter- 
minants. This method is of theoretical interest. In practice, it will only be feasible 
for sets of two or three equations. 

Given a set of equations in matrix notation 


411 -4441н Х1 bi 


an1 --. Ann Xn bn 


Let det A be the determinant of the matrix of coefficients A. If det A 5 0, the 
system has a unique solution. 

Let A be a matrix which is obtained by replacing in the matrix of coefficients 
the kth column by the column vector b. The solution is then given by 


det A‘) 
= k —1,2,3,... 
Xk det A ( , , , ,n) 
We will refrain from giving the proof. Although it is straightforward it is quite te- 


dious. 


Cramer's rule Given a set of linear algebraic equations Ax = b, the solu- 
tion is 


— de AU 


== = 1,2, 3,02 
Xk det A ( , ‚3, ,n) 


det A) is generated from det A by replacing the column of 
coefficients а; of the variable x, by the column vector b. 


Regarding Cramer's rule, we can draw some conclusions about the existence 
of a solution which are obvious and plausible and have already been stated in 
Sect. 15.2.4. 


(a) The case of a non-homogeneous set of n linear equations with n unknowns. If 
det A — 0, then Cramer's rule cannot be applied. Such a set of equations has 
either an infinite number of solutions or none at all. In this situation, the concept 
of the rank of a determinant is of great value. 
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(i) If det A is of rank r « n and any of the determinants det A (9) are of rank 
greater than r, then no solution exists. 

(ii) If det A is of rank r < n and none of the det A have a rank greater 
than r, then there is an infinite number of solutions. 


(b) The case of a set of homogeneous linear equations (5 — 0). 


() This set of linear equations always has the trivial solution 
X] —X5 SS K=O: 
(ii) A non-trivial solution exists if and only if the rank r of the matrix A is less 
than n, і.е. г <n. 
(11) A homogeneous set of equations with m independent equations and n un- 
knowns has a solution which differs from zero if n > m. The solution con- 
tains (n — m) arbitrary parameters. 


Example Consider the following set of non-homogeneous equations: 


X140 x2+x3 =$ 


3x1 +2x2 + x3 = 49 


5x1 — 3x2 + X3 =0 


It can be written in matrix notation thus: 


1 1 X1 8 
з 2 1| [х |= [49 
5-3 X3 0 
We can calculate the determinants: 
1 1 1 8 1 1 
detA=|3 2 1|—2-12 det AU — 49 2 1|2 —156 
5—3 |] 0—3 1 
1 8 1 1 1 8 
det A2 = |3 49 1/--180 det A9 – |3 2 49| —240 
5 0-1 5-4 0 


From Cramer’s rule, the solution is 
x1—13, x2=15, хХз--20 
Example Consider now the following set of non-homogeneous equations: 


xı +2x2+ 3x3 = 4 
3x, = 7х2 + x3 = 13 
4х1 m 8x2 = 12x3 L2 


It can be written in matrix notation thus: 
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1 2 3\ x 4 
3-7 1) | x.) 5 [ 13 
4 812) \x2 2 


We can calculate the determinant: 
I 2 3 
det A = |3 -7 1|=0 
4 812 
According to the above statement, this set of equations has either no unique solu- 


tion or no solution at all. To decide which is the case, we use the Gauss-Jordan 
elimination and obtain, after the first step, 


1 2 3 4 
0 —13 —8 x= 1 
0 0 0 -14 
The last equation 0 = —14 is impossible. Thus the system has no solution at ай. The 


same result follows if we look at the rank of the determinant of A. The rank is 2. 
Since the rank of det A“) is 3, there 1s no solution at all. 


Example Consider again the set of homogeneous linear equations given in the Ex- 
ample on p. 437. 


1 4-1 
4 16-41х-0 
2 —3 1 


The first and second equation differ only by the factor 4. Hence the equations are 
linearly dependent and 


1 4-1 1 4-1 
4 16-4-01 0 0-0 
23d 2-3 1 


Thus a non-trivial solution exists. Rewriting the first and third equations gives 


хі + 4x2 = x3 


2х1 — 3x2 = —X3 
From Cramer’s rule, we have 
X3 4 1 xs 
Ea um NE 12 =х3|  3xs 
“a= Meg арт а 
2-3 2-3 


Hence we see that, as before, the solution contains one arbitrary parameter. 


448 15 Sets of Linear Equations; Determinants 


Vector Product in Determinant Notation 


In Chap. 2, Sect. 2.2.7, we defined the vector product of two vectors 
а = (ах, ау, az) and = (bx, by, bz) as 


axb=i(aybz —azby)+ j (azbx — axbz) - k(axby —aybx) 


If we regard the expressions in brackets as two-row determinants, the RHS of the 
equation can be looked at as the evaluation of a three-row determinant: 


i j k 
a x b = jax dy az 
bx by bz 
From the properties of determinants it follows that 
ахф--д ха, since 
i j К i j k 
ах ау dz | = — bx by bz 
bx by bz ах dy az 


Volume of a parallelepiped 


Consider the parallelepiped defined by the three vectors a, b, and c (Fig. 15.1). 
From Chap. 2 we know that the value of the vector product z — a x b represents the 
area of the base. Furthermore, z is a vector rectangular to the base. 


Fig. 15.1 


The projection of c on to z represents the height of the parallelepiped. Thus the 
volume is 
V = |e -z| = |с(а x b)| 


Written as components: 


V = |cx(aybz —azby) 4- cy (azbx — axbz) + ez(axby —aybx)| 
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This can be expressed as a determinant (up to sign): 


Cx Cy Cz ах dy dz 
V= ах ау 4 | = by by bz 
bx by bz Cx Cy Cz 


Note that the sign of the determinant is positive if a, b and c are oriented according 
to the right-hand screw rule (see Chap. 2). 


Exercises 


15.2 Sets of Linear Equations 


1. Solve the following equations using either Gaussian or Gauss-Jordan elim- 
ination. Use matrix notation. 


2x1 + xX2+5x3 = —21 х= y+3z=4 
(a) xı +5x2 +2x3 = 19 (b) 23x+2y+4z = 13 
5x1 +2x2 + х3 = 2 11.5х+ y+2z = 6.5 
xı + х2 + х3 = 8 1.2x — 0.9у + 1.52 = 2.4 
(с) 3х +2x2 + х3 = 49 (d) 0.8х—0.5у + 2.52 = 1.8 
5x1 — 3x2 + x3 = 0 1.6x—1.2y+ 27 = 3.2, 
2. Obtain the inverse of the following matrices: 
21 0 
(a) [11-2 
03-4 


Jer 


3. Investigate the following sets of homogeneous equations and obtain their 


solutions. 
xı + Х2-33-0 2x— 3y+ z=0 
(a) —X1 +3x2 4- x34 = 0 (b) 4x4 4y— 1-0 
хэ + хз = 0 х-3у-4:-0 


2 
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15.3 Determinants 


4. Evaluate the following determinants: 


4 3 2 1 7 4 12 
(а) |1 0-1 5 5 4 3 
5 2 2 (b) -2 6 25 3 
3402 5 35 20 60 
(c) 6 1-3 1 4 60 7 
lo rae A 20 @ | =3 9 2.8 
5-1 2 4 10 1 0 2 
102 3 5 2 0 1 
2 1 8 5 
© | o 0-4-2 
1 O 1 4 
5. Determine the rank r of 
(a) A= 2-2-2 0 (b) B= 
0202 3 1 3 1 
2 "qm. 


6. Find out whether the sets of linear equations given in question 1 are uniquely 
solvable by examination of the determinant of the matrix of coefficients. 


Chapter 16 
Eigenvalues and Eigenvectors of Real Matrices 


16.1 Two Case Studies: Eigenvalues of 2 x 2 Matrices 


In Chap. 14 it was shown how a matrix A and a vector r can be multiplied to give 
a new vector r' (provided the dimensions of the vector and the matrix fit): 


r' = Ar 


Let us remember that each row of A is to be multiplied with r, which is thought to 
be a column vector. As an example, we will consider a 2 x 2 matrix A and a 2-row 
vector м; multiplication results in a new 2-row vector r’. 


0.50 1 
IfA= Ё J and r = (1) then 


EDO- 


Figure 16.1 shows both the old vector r and the new vector r’. The result of applying 
A to r can be described as reducing the x-component by half and doubling the y- 
component. 


Fig. 16.1 
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Generally, the new vector r' and the old vector r will point in different directions. 
However, there are some special vectors whose direction does not change when A 
is applied. 

If r points along either axis, then for the matrix A under consideration the cor- 
responding vector r’ will point in the same direction (Fig. 16.2). 


Fig. 16.2 


To give an example, 


if "-(15) then 251 (14|-(1) ан 


Instead of applying A to this special vector we could simply multiply rı by the 
scalar 2. This is, of course, by no means true for any vector. Therefore a special 
nomenclature has been introduced. 


Definition Given an n x n matrix A and an n-vector r, if r' = Ar points 
in the same direction as ғ, i.e. r’ = Ar where А is a real scalar, 
then r is called an eigenvector of A with real eigenvalue A. The 
cases r = 0 or A = 0 are excluded from this definition. 


The last example could thus be rephrased as follows. The vector r is an eigen- 
vector of А and the corresponding eigenvalue A; = 2. In this case, there is also 


a second eigenvector, e.g. r2 = , with eigenvalue Л2 = 0.5. Thus the matrix A 


1 

0 

possesses two real eigenvalues and we have found two corresponding eigenvectors. 
Three questions now arise: 


1. What is the maximum number of real eigenvalues and eigenvectors for a given 
matrix? 

2. Does every matrix possess real eigenvalues and eigenvectors? 

3. How can these real eigenvalues and eigenvectors be computed? 
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We will restrict our examples to the case of 2 x 2 and 3 x 3 matrices, and, before 
discussing generalities, we should look at a second, slightly less trivial, case. 


1.25 0.75 
0.75 1.25 
Clearly, vectors pointing in the direction of an axis do not solve this problem. We 
could embark on a trial and error search. But that could be tedious because real 
eigenvalues might not exist! 

Therefore, let us start by reformulating the problem. We wish to find a number A 
and a vector r such that 


Example For A = ( ) find the eigenvalues and eigenvectors. 


Ar = Ағ (16.1) 


Let us write this down as a set of two equations for the x and у components of r: 
1.25 0.75 x 
с te [3 in () 


1.25x+0.75y = Ax 
0.75x + 1.25y = Ху 


The equations are 


By subtracting the RHS, a homogeneous set of two linear equations is obtained: 
(1.25—A)x+0.75y = 0 
0.75x + (1.25— 4)y = 0 (16.2) 


By definition, the trivial solution x — y — 0 does not interest us. Are there any non- 
trivial solutions? We know from Chap. 15 that these indeed exist, if the determinant 
of the coefficients vanishes: 


(1.25—A)? —0.75? 20 (16.3) 
This is a quadratic equation in A, and there are two distinct real roots: 
Àà1—22, A2=0.5 


The computed values are the only candidates for the eigenvalues of A. Inserting 
them one after the other into the set of (16.2) in fact gives the following solutions: 


ForA, r= (i) 
1 
For à? рә = (1) 
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Fig. 16.3 


(Any scalar multiple would naturally do as well.) For the sake of clarity, we will 
check explicitly that these vectors do satisfy (16.1) with A = A, and A = Ag, respec- 
tively (Fig. 16.3): 


jl ge. 025 005 LV. о ordi 
ган te i2) (1) Е (2) | 

Хун дэл (ТЭЭ ELS p 05 1 
Cid Ge ЕЭ E Е (202) a Ё! 


Let us recapitulate. There are two eigenvalues for A and for each of them an eigen- 
vector has been found. The eigenvalues were obtained as the roots of (16.3). That 
equation deserves some attention. It is called the characteristic equation of A. 

We know well that polynomial equations need not have any real roots, and, in 
general, some roots are complex and some are real. There are, at most, as many 
real roots as is the degree of the equation; in particular, a 2 x 2 matrix has, at most, 
two real eigenvalues. (Consider for example, the matrix given in question 3 of the 
exercise at the end of this chapter.) Also, any 2 x 2 matrix describing a rotation about 
an angle a Æ 0 or л, has, evidently, no real eigenvalues. 

Please note that we are dealing throughout this book with real matrices and real 
vectors, i.e. all entries must be real numbers. Accordingly, it would not be suitable 
to use complex scalars for multiplying vectors, and we do not consider complex 
eigenvalues. But you should be aware that in other situations it may be quite useful, 
or even unavoidable, to use the complex values. 


16.2 General Method for Finding Eigenvalues 


In order to find a general procedure for obtaining all eigenvalues and eigenvectors of 
a given matrix A, we will retrace the steps taken in the preceding section; however 
we will employ a somewhat more abstract notation. 
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Given a square n x n matrix A, we want to find all real eigenvalues of A (up to 
n distinct values) and an eigenvector for each of them. 
Equation 16.1 still describes the general situation correctly: 


Ar — Ar 
Let us insert a unit matrix J on ће RHS: 
Ar = Ме 
As before, ће RHS is subtracted: 
(A — AI)r =0 
This is again a set of linear equations and the condition for finding non-trivial solu- 


tions is that the determinant should vanish. 


Theorem 1 For the real scalar А to be an eigenvalue of the matrix A it must 
be a real root of the characteristic equation: 


det(A — AI) = 0 (16.4) 


This is a polynomial equation of degree n if A is an n x n matrix. 


For convenience, we give the explicit forms of the characteristic equation for 


dimensions 2 and 3: 
аа 
FEE 11 412 
421 422 


If 
then the characteristic equation is 


2 
À* — (a11 +.22)A + 411422 — 412821 =0 (16.5) 
If 
411 412 413 
A = | a21 422 423 
431 432 433 


then the characteristic equation is 


=A? + (a11 +a22 База) ДР 
— (411422 + 411433 + 422433 — 412421 — 413431 — A23032)A 
+ detA =0 (16.6) 


For a square matrix of any dimension n the characteristic polynomial starts with 
(—1)"A" + (—1)^71 A"71 (a11 +a22 +++» + ann) and it always ends with +det A. 
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The first non-obvious coefficient is always the sum of the entries along the main 
diagonal of A. It is called the trace of A: 


tr(A) = ај +422 +азз+::: +аһп 


After determining all real roots of the characteristic polynomial, we proceed to solve 
the homogeneous systems of linear equations in order to find eigenvectors. 


16.3 Worked Example: Eigenvalues of a 3 x 3 Matrix 


This section goes step by step through the details of finding the eigenvalues and 
eigenvectors of a given 3 x 3 matrix. 


21 3 
A= |12 3 

3 3 20 

1. Find the characteristic equation. Its RHS = 0 and its LHS is given by the deter- 
minant 
2-A 1 3 
det{ 1 2-A 3 |--42-2412-651-42-0 
3 3 20-A 


2. Find the roots of the characteristic equation. This means solving a cubic equa- 
tion — and we could try any of several approaches to this problem. 
We can (a) use numerical methods; (b) refer to Cardan’s formulae for third order 
equations to find the solutions explicitly; (c) try to guess a first solution Ау and 
then divide the cubic polynomial by (A — А1) in order to obtain a quadratic 
polynomial. 


For the given matrix A, we use the third approach. It is not hard to see that Ay = 1 
is a root. Therefore, we can split off the linear factor (A — 1) and the characteristic 
polynomial can be written thus: 


—A3 - 244? — 654 +42 = (А — 1)(-А2 + 23А — 42) 20 


In order to find the two other eigenvalues, if the roots are real, we solve the quadratic 
equation 
А2 —23A 42-0 


Its solutions are 


2 
2 2 2 19 
À2,3 = 5 z ( 2) 42 = T 
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Now we know that there are, in fact, three distinct real eigenvalues of the given 
matrix A: These are 


Ар-1, Л = 2 апа Аз = 21 


3. For each eigenvalue A; we must now find a non-trivial solution ғ; of the respec- 
tive homogeneous sets of linear equations 


(A — AjD)ri =0 


The vectors obtained will be eigenvectors of the matrix A to the respective eigen- 
value Àj. 
When À — 1. Set to solve: 


3x1 = Зуу т 1921 = 


A particular and non-trivial solution is obtained if 2] = 0 and hence x, = — y1. We 
can put, for example, x; = 1, y; = —1. Then the vector found is 
Fi = -1 
0 


It is an eigenvector of A with eigenvalue 1. 
When Л = 2. Set to solve: 


0x5 “ГР 1у2 +r 3z5 —0 
1x3 +0y2 T: 3z5 —0 


3x2 + 3y2 + 1822 = 0 


The third equation can be seen to be linearly dependent on the two other equations, 
so we can multiply each one of the two first equations by 3 and add. We need only 
consider the first two equations: 


y2+3z2 = 0 


хо 322 = 0 


They give x2 = y2 = — 322. A particular solution is obtained by, e.g. letting 22 = — 1; 
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It is an eigenvector of A with eigenvalue 2. 
When À = 21. Set to solve: 


—19x3 T lys + 323 = 0 


1x3 4 19y3 ae 323 -0 


3x3 B Зуз = 123 = 0 


Again, the third equation can be seen to be linearly dependent оп the two other 
equations, so we can add the first two equations and divide by —6. We need only 
consider the first two equations. 

They give бхз = буз = 23. 

A particular solution is obtained by, e.g. letting z3 = 6: 


r3 = 1 
6 


It is an eigenvector of A with eigenvalue 21. 
The problem of finding the eigenvalues and eigenvectors of the given matrix A 
is thereby solved exhaustively. 


16.4 Important Facts on Eigenvalues and Eigenvectors 


The matrix A in the preceding section was chosen deliberately. It is symmetric, i.e. 
it equals its own transpose. It seems we were lucky in being confronted with a ma- 
trix which duly possesses three real eigenvalues and corresponding eigenvectors. 
But that was not a coincidence; it illustrates the following theorem, which we shall 
not prove. 


Theorem 2 A real non-singular symmetric n x n matrix possesses n real 
eigenvalues. Corresponding eigenvectors can be found such that each of them 
is orthogonal to each one of the others. 


You should not find it too difficult to verify the second half of the assertion for 
the case of the matrix A. We can now answer the three questions posed in Sect. 16.1 
more explicitly. (We assume that the matrix is non-singular.) 


1. The maximum number of real eigenvalues and eigenvectors of a given n x n 
matrix is n. If the matrix happens to be symmetric, this maximum is attained. 

2. Thefollowing statement is relevant only for the case of non-symmetric matrices. 
If n is even there may be no real eigenvalues at all of a given n x n matrix. 
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If n is odd there must be at least one real eigenvalue of a given matrix, since the 
characteristic polynomial is of odd degree. 

3. Eigenvalues are found by solving the characteristic equation (Eq. 16.4). Eigen- 
vectors are determined by finding a non-trivial particular solution of the result- 
ing set of homogeneous linear equations. Please remember that the values A = 0 
and r = 0, respectively, are not admitted. 


Exercises 


4 
1. (a) For A = 13 
(b) In a diagram draw two corresponding eigenvectors. 


3) find the eigenvalues. 


2. Is it possible for a real 2 x 2 matrix to have one real and one complex eigenvalue? 


3. Prove that there are no real eigenvalues of the matrix 


iet 


-1-11 
4. (а) For A = | —4 24] find all the eigenvalues. 
-1 15 


(Hint: they are all integers.) 
(b) Find corresponding eigenvectors. 


5. In certain, rare, cases finding suitable eigenvectors may prove difficult. In order 
to illustrate what might happen, find the roots of the characteristic equation for 


«01 


Then try to find corresponding eigenvectors. 


Chapter 17 


Vector Analysis: Surface Integrals, Divergence, 
Curl and Potential 


17.1 Flow of a Vector Field Through a Surface Element 


Consider a steady flow of water through a pipe. The water is assumed to be incom- 
pressible, i.e. it has a uniform density (for which we will use the symbol p), the 
velocity of each particle having a constant value v = ds /dt. 

Thus, by assigning each point inside the pipe the velocity of the particle of water 
at that point, we are confronted with a field of vectors. See Fig. 17.1. 

For a start, let us simplify the discussion by stipulating that the velocity is con- 
stant, i.e. it has the same magnitude and the same direction at all points. In this case 
we call the vector field homogeneous. 

Now imagine a small plane rectangular frame that is placed perpendicular to 
the flow of water (Fig. 17.2). We wish to determine the amount of water passing 
through the enclosed area A during a given time interval At. Evidently, this will 
be determined by the water that is contained in the cuboid defined by A and the 
extension As. This extension is given by 


As = v- At 


So the volume is 
V=A-v-At 


The mass of the water passing through the area in the given time will be 


AM =p-V=p-AvAt 


Fig. 17.1 Fig. 17.2 Fig. 17.3 
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A characteristic value for the flow is the mass of water passing through a unit area 
per time unit. It is given by the quotient 


AM | 
AAt 
This value is called the flow density j . It is a vector since the velocity is a vector. 


p v=j 


Definition The expression j = p- v is called the flow density. 
Its absolute value is given by the amount of water passing 
a unit area per time unit. 
(The area is assumed to be perpendicular to the velocity v.) 
The vector j is parallel to v. (17.1) 


If we consider an arbitrary plane area A perpendicular to the direction of flow, 
the total flow is given by 
Г-А4Д-аг) 


Next, we assume the plane area A to be in a general position, i.e. its normal and 
the direction of flow include an arbitrary angle o. Figure 17.4 shows us that the 
projection area perpendicular to the flow, which we will denote by А у, is given by 


A; = Acosa 


As the flow density is assumed to be a constant, the mass of water passing through an 
arbitrary plane A area evidently equals the mass passing through its projection A ;. 
So the flow 7 passing through A is given by 


I= j- Aj = jAcosa 


This equation formally resembles the scalar product of two vectors j and A, which 
is yet to be defined. 

We next introduce the concept of a surface element vector by taking the orienta- 
tion of the surface element into account. 


Definition The surface element vector A of a plane element A is given by 
the vector perpendicular (i.e. normal) to A with magnitude 


|A|- 4 (17.2) 
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The sign of A will be determined by convention. In our case it is convenient to 
choose it in a way that A points along the direction of the flow. 


Example Consider a square element A that is part of the x-z plane as shown in 
Fig. 17.5. It will be assigned the vector 


A = A(0,1,0) 


Fig. 17.5 


Example We now slide the base of the square element along the y-axis, so that it 
defines a 45° angle to the x-y plane as shown in Fig. 17.6. Now its surface element 


vector is given by 


A 
A= 550011) 


х 


Fig. 17.6 


So we are in a position to describe the flow 7 through an arbitrary plane surface 
element A as a scalar product (also called dot product): 


1-)-А 


Let us generalize this concept by looking at any homogeneous vector field 
Е (x,y,z) and define its flow through any plane surface element: 


Definition Given a plane surface element A and a homogeneous vector 
field F, the flow of F through A is defined to be the scalar 
product of F and the surface element vector A: 


Е.А = Flow of F through A (17.3) 
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Please note: The flow of a vector field through a surface A is sometimes also 
called the flux across A. Unfortunately, the usage of this term is ambiguous, since 
some authors refer to the flux as the flow per time (e.g. in fluid dynamics), while 
others use it for the flow itself (e.g. in electrodynamics). In order to avoid confusion, 
we will not use the term flux in our book. 


17.2 Surface Integral 


The definition 17.3 of the flow of a vector field through a surface imposes two 
restrictions: 


1. The vector field is supposed to be homogeneous. 
2. The surface element A is thought to be plane. 


We will now drop these two restrictions. Let us consider arbitrary vector fields 
and curved surfaces. 

From Chap. 12 on Functions of Several Variables we already know that a func- 
tion of two real variables generally defines a curved surface in three-dimensional 
space. 


Example A hemisphere on top of the x-y plane is given by the function 


z=+1/R2—x2— y? 


How can we compute the flow of any vector field F through a curved surface A? 

A good approximation can be obtained as follows: By dissecting the surface A 
into sufficiently small elements AA;, we may assume these small elements to be al- 
most plane. Thus we can work with surface element vectors AA;, with |AA;| = A4;. 
Furthermore, for each AA; we may assume the vector field F to be homogeneous 
on that small element. See Fig. 17.8 and 17.9. 
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So, approximately, the flow of the 
vector field F through the element 
AA; is given by 


F (xj, yi,21) AA; 


We have assigned indices i to the 
arguments x, y, and z of F. That 
means that we compute the value 
of the vector field for a point 
(xi, yj, zi) on the element AA;. 


Fig. 17.9 


The total flow of the vector field F through the surface A is obtained by adding 
the flows through all the elements AA;: 


n 
Flow of F through А : = У. Е (xi, yi,zi): AAi 
i=1 
Refining the dissection by using ever smaller elements AA; will increase accuracy. 


Going to the limit n — oe will yield the exact value, which is called the surface 
integral, written as 


I, F(x,y,z): dA = Flow of F through A 


Definition The surface integral of Е (x, y, z) on the surface A (also called 
the flow of F through A) is given by: 


n 
ЇР dA = lim Y, F (x; yr zi) AA; (17.4) 
nS 


Applications of surface integrals in physics and engineering often deal with 
closed surfaces, i.e. the flow of a vector field through a closed surface. 

Definition A closed surface divides space into two disjoint regions, 
so that any continuous path from one region to the other 
must cross the surface (at least once). (17.5) 
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Examples of closed surfaces are: 


* the surface of a cube 

* the surface of a sphere 

* the surface of an ellipsoid 

* the surface of a torus (shape of a donut) 


The surface integral across a closed surface is usually denoted by adding a small 
circle to the integral sign. As was already mentioned, the sign of a surface element 
vector is determined by convention. So we now stipulate that dA is pointing away 
from the inner region. 


Definition Flow of F through a closed surface: 
j Ғ-ДА (17.6) 
dA points into the outward direction. 
Please note that the notation of the integral is not unique, as 


some authors denote the surface integral by a symbol for the 
surface beneath the integral sign, e.g. [4 ... 


dA 
Fig. 17.10 


In the case of a fluid streaming through a closed surface, the flow has a direct 
meaning. It tells us whether more fluid is flowing into or out of the inner region. 


17.3 Special Cases of Surface Integrals 


17.3.1 Flow of a Homogeneous Vector Field Through a Cuboid 


Consider a homogeneous vector field F = (Fy, F. у, Ё, =). The components Fy, F. y 
and Fz are constants (see Fig. 17.11). 

In order to compute the flow of F through the cuboid, we treat each of its six 
faces separately. 
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By choosing the coordinate system 
conveniently, as was done in Fig. 17.12, 
we obtain the following surface vec- 


tors: 
A1 = ab(0,0,1) 
Аз = ab(0,0, —1) 
Аз ET 
A4 = ac(0, 1,0) 
As — (1,0,0) 


Ag = bc(—1,0,0) Fig. 17.11 


In that special case we do not need to 
integrate at all, since the surface inte- 
gral of a homogeneous vector field F 
on a plane surface A is just the dot 
product of F and A. 


Fig. 17.12 
So we determine the six partial flows directly. 
F.A, =ab-F, 
F - A2 = —ab- Fz 
F .Аз = ас: Fy 
Е .Ад = ~ac- Fy 
F -As = bc- Fy 


F. A6 = —bc. Е, 
The total flow through the cuboid then is the sum of its six parts. We thus realize 


that the flow of a homogeneous vector field F through a cuboid always vanishes. 


6 
Total flow = Ж Е-А,-0 
1-1 
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Rule The flow of a homogeneous vector field F through a cuboid van- 
ishes. We can even generalize this result: 
The flow of a homogeneous vector field F through any closed surface 
amounts to zero. (17.7) 


Fig. 17.13 


The reasoning for the generalized statement will be given by plausibility. 

Given any closed surface, let us approximate the region contained therein by 
small cuboids. Figure 17.13 shows an arbitrary surface and one such cuboid. 

As we have just seen for any cuboid the flow of a homogeneous vector field 
vanishes. 

Looking at two adjacent cuboids we observe that the total flow must vanish also. 
As the two surface vectors at their common bounding areas differ by their sign only, 
the overall flow is determined by just the outer bounding areas of the two cuboids. 

By iterating that argument we find that for any set of cuboids, conveniently ap- 
proximating the given surface, the flow must evaluate to zero. 

Looking back to the constant stream of water this result is obvious: the same 
amount that is going into the volume V must come out again. 


17.3.2 Flow of a Spherically Symmetrical Field Through a Sphere 


A field that possesses spherical symmetry (Fig. 17.14) can generally be expressed 
as follows 
Е =e,- f(r) 
er is the unit vector pointing into radial direction: 
r 
er = — 


Ir| 


We assume that the center of the sphere coincides with the origin of the coordinate 
system (Fig. 17.14). 
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Fig. 17.14 


As all the surface element vectors dA are normal to the sphere, they always point 
into the same direction as r. Therefore, the surface integral can be simplified: 


fF .dA = {еа = {ал 
A A A 


The integration is to be performed over the complete spherical surface with the 
radius R. Since the integrand f (r) only depends on r, we may replace r = R in the 
expression f (r). It therefore is a constant and can be factored: 


f fena = d rada = ra) фал 
A A A 
The remaining integral is well known, it is the surface area of a sphere with radius R 


j dA = 4x R? 
A 


We have thus found the following rule: 


Rule The flow of a field with spherical symmetry F = e; f(r) through 
a spherical surface with radius R is given by 


jr -dA = Ax R? f (R) (17.8) 
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17.3.3 Application: The Electrical Field of a Point Charge 


A point charge Q at the origin of the coordinate system gives rise to an electric field. 


fo r=WVx2+y2422 


Fig. 17.15 


This field obviously is spherical symmetric, and we wish to compute its flow through 
a sphere with radius R. We can make use of rule 17.8: 


fF .dA = Ax R? f (R) 
In the particular case we know that F = E (x, y, z) which leads to 
f E -dA = Q 
£0 
This result tells us that the flow of the electric field generated by a point charge is 
independent of the radius R of the sphere. 

This relation, incidentally, holds true for any closed surface surrounding the point 
charge. It can also be generalized to any number of charges distributed inside the 
closed surface. It is then called the Gaussian law, one of the fundamental equations 
describing electromagnetic phenomena. Its name honors Carl Friedrich Gauss, one 


of the most prolific German mathematicians. His name will be mentioned again very 
soon. 


17.4 General Case of Computing Surface Integrals 


We are given a surface integral 


F -dA 
(Роу) 
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By explicitly writing down the dot product, it can be expanded into the sum of three 
integrals: 


(осу) аа = [ [Fedde + FydAy + РВА 


(х,у) 


Fig. 17.16 


That leads to two questions: 


1. How do we determine the components dA,,dA,, and аА; of the differential 
surface vector dA? 
2. How, for any given surface A, do we take the bounds of integration into account? 


Let us start with question 1: Recalling Chap. 1 on Vector Algebra we know that 
any vector r in three-space can be represented as the sum of its components along 
the three axes, each component being a multiple of the base vectors e x,e y, and ez: 


г = хех t yey d zez 


Fig. 17.17 
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Now, what are the base vectors for a finite surface element A? 
Figure 17.18 shows the three unit vectors for the surface element vectors: The 
unit vector in x-direction, for instance, represents a unit square in the y-z plane. 


x SN 
-—-— À— 
y y 
x 1 
X 
Fig. 17.18 


Generally speaking, the components of a surface vector A, Ax, Ay, and Az, re- 
spectively, represent surface elements in the y-z plane, in the x-z plane and in the 
x-y plane, respectively. The components are just the projections of the surface A 
into the appropriate plane, as shown in Fig. 17.19. 


Fig. 17.19 


By looking at ever smaller surface elements we obtain for the components 
dA, = dydz dA, — dxdz dAz — dxdy 


The surfaces that are perpendicular to these vectors are no longer unit squares but 
differential elements dydz,dxdz, and dxdy. 


ах, 


Fig. 17.20 
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We can thus express the differential surface element as follows 
dA = (dydz, dxdz, dxdy) 


Question | being answered; let us turn to the task of determining the boundaries 
for integration. The integral is already known to be a sum of three expressions: 


| ваа z [Feds + FydAy + Fed Az] 
Let us look at the last summand 
fe аА; = | xay 


What is the suitable range for the x and y values? It must be the projection of A 
into the x-y plane, which we denote by Axy (Fig. 17.21). 


Fig. 17.21 


Thus we obtain a double integral 


| oda 


Locally, we assume that for the surface A the z coordinate can be described as 
a function of x and y; z = f (x, y). Inserting this into the expression for Fz we get 


Fr(x,y, f (x,y)) 


Likewise, we assume that locally we can express x = g(y,z) and y = h(x, y). 
Inserting all this into the surface integral results in: 


| сосулаА- |, F(= g0,2),yz)dydz 
А Ayz 
«f, Fy(x,y =h(x,z),z)dxdy 


+ | Fayz =f (x,y))dxdy 
xy 
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zh 


Fig. 17.22 


Example Given is the nonhomogeneous vector field F = (0, 0, y) (Fig. 17.23). 


Fig. 17.23 


We compute the flow through the rectangular area in the x-y plane limited by the 
origin and the points 


Ру = (4,0,0) 
P2 = (0,5,0) 
Рз = (a,b,0) 


We must evaluate the integral 


a b .p? 
LETENI ! y-dxdy =" 
x=0/y=0 2 


This means that the flow increases in a linear fashion as the surface is extended in the 
x-direction, and it obeys a square law as the surface is extended in the y-direction. 


17.5 Divergence of a Vector Field 475 


17.5 Divergence of a Vector Field 


In the previous sections we were confronted with the following question: Given 
a closed surface A, how strongly does a vector field F “flow” through A? The 
answer was given by the surface integral as defined in (17.4). 

This concept is particularly important in the theory of electricity and magnetism. 
Assume that a closed surface contains a stationary charge density p which, inciden- 
tally, is defined by the quotient of charge per volume: p = dQ /dV. 

Field lines emerge from positive charges and they end at negative charges. Thus 
positive charges arc sources of the field, and negative charges are sinks. 


Example If the surface A encloses a positive charge density, the flow of the electric 
field E through the surface is proportional to the total charge Q according to the 


law 
[ваа = © 
£0 


Fig. 17.24 


This generalizes the result obtained in Sect. 17.3.3, where a point charge inside 
a sphere was discussed. 


Now let us proceed to something new: can we assign a meaning to the quotient of 
the flow through the bounding surface divided by the volume of the enclosed space? 


opr 
V 


Physically speaking, this expression must denote the average density of sources (or 
sinks, in the negative case) contained in the volume V. 

If we consider the limit of ever smaller volumes V — 0 containing just the point 
P, we will call this the divergence of the vector field F at the point P. It is denoted 
by div F. 


1 
div F = lim — Е-4А 
v=o V Ay) 
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The divergence describes in a unique manner, whether P belongs to the sources 
of F, in which case div F > 0, or whether it belongs to the sinks of F, in which 
case div F < 0. 

A vanishing value of div F indicates that the field contains neither sources nor 
sinks. 

Having a good grasp of the concept, we now must arrive at a practical means to 
compute the divergence. 

Let us consider a small cuboid of dimensions Ax, Ay, Az as shown in Fig. 17.25. 


diu cun y f Fa 
уо V 


The surface integral is given by the sum of the flows through the six faces, and 
on each one the vector field will be constant, approximately. 


24 К 
F (x, у, Z + AZ) 


F (x, y + ay, 2) 


Fig. 17.25 


As the components of F are parallel to the surface element vectors, we get: 


ofr ~ ау "ns — — —LFx (x - Ax, у, 2) – Fx(x, y, z))AyAz 
+[Fy(x, y - Ay, z) – Р(х, у, z))AxAz 

[Е (х, y, z c Az,z) – Fz(x, y, z)|AxAy} 
Fx(x - Ax, y, z) – Fx(x, y, Z) 


Ax 
Py, Y t Ay, z) - Fy(x, y, z) 
Ay 
Fz(x, y, z - Az) — Fz(x, y, Z) 
Hu Az 


In the limit V — 0 i.e. Ax — 0, Ay — 0, Az — 0, we end up with three partial 
differentials. 


17.5 Divergence of a Vector Field ATI 


Definition Divergence of the vector field Е 


. m! OF, 0Ё, OF 
div F = Jim p Fda = t + So 4 (17.9) 
The divergence of a vector field is a scalar. In other words: the operation of 
computing the divergence maps a vector field F onto a scalar field div F. 
Please recall Chap. 12.4 on total differentials, where we have already encoun- 
tered the operator just mentioned. We will now give it the name nabla (sometimes 
also called del) and a symbol V of its own: 


9 9 9 
v= (am) 


This identity, which is valid in the Cartesian coordinate system, enables us to, 
formally, express the divergence of a vector field as the dot product of the nabla 
operator and the vector field: 
дЕ, | OF, дЕ. 

Ox ду 92 

Let us return to our example from electrostatics where a given charge density 

p — dQ /dV gives rise to a field E. As was already mentioned, the laws of physics 


tell us 
{ева = 2 
£0 


Q is the total charge inside the volume that is enclosed by the surface A. 
After dividing by the volume V and going to the limit V — 0 we obtain 
р(х,у,2) 
£0 


divF =V-F= 


div E (x,y,z) = 


We thus arrive at an equation connecting the values of E and p for each point in 
space. 


Example The divergence of a homogeneous vector field vanishes because the 
derivative of a constant always amounts to zero. 


Е (х, у,2) = (a,b,c) 


Example The vector field F (x,y,z) = (x,y,z) is characterized by a constant di- 
vergence = 3. 
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Fig. 17.26 


Example The electric field Е generated by a spherical charge distribution with 
total charge Q and radius R is given by the following expression for points away 
from the sphere: 


Е (х,у,2) = Q (X92) 
Ame (A/ x2 + y2 + z2)3 
For points inside the sphere it is given by 
E (x, y,z) 


= Meer eoa) 


Outside of the sphere the divergence of the electric field vanishes: 


2 d 92:31:72 
dvE- 016 хашин, 
4780 (\/x2 + y2 + 22)3 (x2 + y24z2)5 


Inside the sphere we get 


Q 


div E = —— = — 
4reoR? ео 


Assuming a homogeneous charge distribution inside the sphere, each point is 
a source of the electric field. Away from the sphere the field possesses neither 
sources nor sinks. 


17.6 Gauss’s Theorem 


This theorem establishes a connection between the integral of the divergence of any 
vector field F over any volume in space V and the total flow of that vector field 
through the bounding surface A. 

As usual, let us dissect the volume V into и small parts AV; having boundary 
ЛА;. For each such element we can approximate the value of div F as follows 


1 
div F (x;,;,2;) © AV. ў ғ -аа 
1 
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Multiplication by AV; and adding the n values yields 


n n 
У div F (xi, yi, zi)AVi 25 bj fF -dA 
i=1 i=1 


It is quite apparent that all adjacent faces of the AV; in the interior give rise to 
values which cancel out. So just the values of F -dA on the surface A are left over 
to be summed up. In the limit n — о and V; — 0 we arrive at 


n 
lim Y div F (xi yiz)AV; = | div F -av 
нээрт 4 


and furthermore И 
lim F-dA— f F-dA 
П оо 2 А; Д 


Fig. 17.27 


All this adds up to the following theorem. 


Gauss’s theorem 
1 div F -aV = ў F -dA (17.10) 
y A(V) 


Gauss's theorem is sometimes also referred to as Ostrogradski's theorem to honor 
the Russian mathematician Michel Ostrogradski (1801-1861) who independently 
worked on this topic. 

The theorem tells us that the integral of the divergence of a vector field over 
a volume in space equals the total flow of the vector field through the bounding 
surface. 
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17.7 Curl of a Vector Field 


Some vector fields are especially well behaved in the following sense: given any 
two points Pı and Р», the value of the line integral from the first point to the other 
does not depend on the particular path connecting them. 

One well-known example is provided by the gravitational field, another one is 
the electric field generated by static point charges. In cases like these, we are free to 
choose the path in a most convenient way in order to facilitate the integration. 

These vector fields are characterized by the following result: 


Observation The value of the line integral from a point Pı to another 
point P5 does not depend on the chosen path if and only if 
any closed line integral along any closed curve C vanishes: 


f F-ds=0 
C 


For the easy proof we only have to realize that an arbitrary closed path containing 
both Pı and Р» can be split into two paths: the one connects Pı to P2, which in 
Fig. 17.28 is denoted by С, and the other one, Со, connects Р» to P1: 


P5 Pi 
jr] Fdss[ F.ds—0 P, 
Ci C5 
Р, 
Equivalently: C; 
Р, P1 
F -ds = — F -ds Fig. 17.28 
Pi Po 
Ci C2 


Inverting the direction of the path changes the sign of the integral and we are 
done. 


Р, 

F -ds = F -ds 
Р Р, 
Ci C5 


A vector field is called curl-free or irrotational if all line integrals along any 
closed curve amount to zero. 

This definition would not make sense, if all vector fields had that property. In- 
deed, there are many such vector fields where generally 


$, F-ds#0 
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Conversely, a vector field for which the line integrals do not generally vanish is 
said to possess curl. 


Example Suppose a magnetic field B perpendic- 2 
ular to the plane changes in time and thus gen- 

erates an electric field as shown in Fig. 17.29. 

The work required to move a positive charge from 

point Pı to P2 is certainly dependent on the path 

chosen: along Сү it will be negative, whilst along 

C» it will be positive. 

So the line integral from P, to P5 and back to Pi 

again is the difference between the two values and P 


will not be zero. 
Fig. 17.29 


The value of the line integral along a closed curve is called circulation. 

Figure 17.30 depicts three vector fields and a circular path in each of them. The 
corresponding circulation is largest in case 1), and it is zero in case 3). 

It is very useful to be able to talk about the "circulation" at a single point. So 
quite similar to determining the divergence in Sect. 17.5, we set out to compute the 
line integrals for ever smaller contours, all of which converge to a common point P. 

However, we must observe one additional fine point: as we are dealing with sur- 
faces, we must take their orientation into account. This is given by the surface ele- 
ment vector A. 

Now consider the limit: 


1 
lim — $ F -ds 
4-0 A JC(A) 
We are free to orient A in different directions and will normally get different 
results. In fact, the vector we are keen to know is obtained as follows: use the three 


Fig. 17.30 
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standard planes to compute three components and combine them to form a vector 
accordingly. 

The vector thus obtained is called the curl of Ё, sometimes also denoted by 
rot F in accordance with the notation of the famous Scottish physicist James Clerk 
Maxwell (1831—1879). 

For a physical understanding of the concept of curl think of water flowing along 
a brook. As opposed to the tame example in the introductory section of this chapter, 
now the vector field v will describe both a general flow along the main direction and 
locally also give rise to circular movements. Imagine a small spherical body having 
the same density as the water immersed in the fluid. At points with nonzero curl 
the sphere will spin. Its rotational axis points into the direction of curl v, its angular 
velocity indicates the magnitude. 


The rigorous but tedious proof of this 
statement will not be given here. The 
keen reader is advised to check special- 
ized mathematics books or consult the Az | Ay-Az 
Internet. М à 
The next task is to arrive at a concrete 
rule for calculating the components of - 
the vector curl F. Let us start with the 

x-component and choose a small rect- 

angular plane surface Ах with dimen- 

sions Ay and Az. Fig. 17.31 


«Y 


A first approximation for the x- 
component of the line integral can be 
obtained by multiplying each length of = Fyayz+ az) 
the rectangle by the projection of F 
onto the path of integration as shown 
in Fig. 17.32: 


Ey + Ay,z) 


> 


Fixyz) 
эс 


> 
< 


Az 


> 


х 


-F YQY,2Z) 


БА 


Fig. 17.32 
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: $, ras 
AyAz JC, 
1 
Ke equo TAA AE a Be ty ААУ 
T Fy(x, y, z)Ay] 
I Fz(x,y +Ay,z) — Fz(x,y,z) Fy(x, y,z c Az) — Fy(x,y,Z) 
-| Ay Az 


curl, F = 


In the limit Ay — 0,Az — 0 we obtain the difference of partial derivatives 


The other components of curl F can be determined similarly. So we can an- 
nounce the following: 


Definition Curl of a vector field 


OF, OFy OF, OF; дЕ 23 


ду Oz ' Oz Ox’ Ox ду 
(17.11) 


curl F = ( 


Using the nabla operator V in Cartesian coordinates the curl of a vector field F 
can be expressed as the cross product of V and F: 


culF -Ух Е 


Like any cross product the curl can also be expressed as a determinant: 


ех €y ez 
LU CA 
Ox ду Oz 
Е; Fy Е, 


curl F = 


Computing the curl assigns a new vector field to the given vector field. Recall, 
that computing the divergence maps a vector field into a scalar field. 


Example Let us compute the curl of a spherical symmetric field: 


F (x,y,z) = (x,y,z) 
curl F = (0,0,0) 


This vector field obviously is free of curl. (The reader is invited to generalize this 
result to any spherical symmetric field.) 
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Example Figure 17.33 shows a cross sec- 
tion of a fluid streaming in the y-direction. 
The velocity is zero at the bottom (z = 0) 
and it increases linearly proportional to the 
height. 

The velocity field can be expressed as fol- 
lows 


0(х,у,2) =а2:еу; а = const 


The curl of v is (—a,0,0), and so the line 
integral along a path C as indicated will 
not vanish. 


Example Compute the curl of the vector 
field 


Е (x,y,z) = (—y,x,0) 
curl F = (0,0,2) 


This vector field is not curl-free which 
becomes apparent when looking at 
Fig. 17.34. 


17.8 Stokes’ Theorem 


This theorem establishes a connection be- 
tween the integral of the curl of any vector 
field F over any surface in space A and the 
line integral of that vector field along the 
boundary C. 

Figure 17.35 shows how the surface can be 
approximated by small elements dA ;, each 
of which is bounded by C;. 


zh 
D 
т Ln 
Lem 
Ld т 
1 
1 1 
1—5 — 
Y 
m — 
-- 
С 
Ld 
x y 
Fig. 17.33 


Fig. 17.34 


АА, C 
AA. 


Ха» 


Fig. 17.35 


We next compute the line integral and observe that it approximates the product 


of curl F times the small area: w 


1 727 
Cj 
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Summing for all 7 leads to 


Your. -AA; = Ў omi F Xi, Yi, Zi) AA; 54 Хү F -ds 
1=1 i=1 


bys С 
Ву an argument similar to the one used be- 


fore we realize that line integrals along in- 

ner paths cancel in pairs so that only the Á 
paths at the outer boundary are relevant. 

The limiting process, as AA; — 0,n — о 

then leads to Stokes' theorem. 


Fig. 17.36 


Stokes" Theorem 


3 curl F -dA -4 F -ds (17.12) 
A C(A) 


Its name honors the Irish mathematician and physicist Sir George Gabriel Stokes 
(1819-1903). 

Stokes’ theorem tells us that the integral of the curl of a vector field on a surface 
A equals the line integral along the boundary C. 

Suppose we know for a given vector field that curl F = 0 in a certain volume 
V which contains the area A. Then the left hand side in Stokes’ theorem vanishes 
altogether and we are left with 


j 22777 
С(А) 


Recalling Sect. 17.7 we can then be certain that the line integrals in that particular 
volume are independent of the path. 


17.9 Potential of a Vector Field 


Assume that a given vector field F (х, у,2) is curl-free. According to the observation 
from Sect. 17.7 the line integral between two points Ро and P does not depend on 
the chosen path. 

Consider now P to be a variable and assign the value of the line integral to the 
point. This is just an ordinary function of P. Let us call that function the vector 
potential (Р) of the vector field F. 


P 
g(P) = Е -ds (17.13) 
Po 
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Obviously, this procedure will work for any curl-free vector field. The potential 
ф is uniquely defined up to an additive constant, which is determined by the choice 
of Po. 
Next, let us convince ourselves that from the definition 17.13 the following rela- 
tion can be arrived at: 
Е (x,y,z) = grado 


Recall, how in Sect. 12.4.3 we have derived а vector field from a given scalar 
function, named its gradient. The vector grad q is orthogonal to the surfaces which 
are defined by ф = const. and its magnitude indicates the change of р as we move 
from one surface to the adjacent one in an orthogonal direction. 


The change of р with respect to an infinitesimal change in space is given by: 
аф = grado ds 


In case of larger distances an integral must be used and we then obtain 


P 
Ф(Р)- | grado ds 
Po 


This happens to coincide with definition 17.13 for the potential of a vector field. 
So we conclude 


др дф 9 
Any curl-free vector field F can be assigned a potential field o according to the 


relationship: 
Р-(х,у,2) 


es») = |, F -ds 


Po 


If we know the scalar field ф(х, у,2) the corresponding vector field F (x, y,z) 
can be computed as its gradient. 


potential ф Vector F 


The meaning of this roundabout connection for physics is as follows: the field 
F can be interpreted as a force field and р as the potential energy. Furthermore, 
by convention, the potential can be defined as the value of the line integral along 
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any path connecting some arbitrary point Ро to the point P against the force field, 
which means that the sign is reversed. 

To sum up, in physics the relation between a curl-free force field F and its po- 
tential ф is defined as follows: 


Р 
paya =- | Fds 
Po 
F (x,y,z) = —grado 


As was already mentioned in Sect. 6.9, a force field without curl is also called 
a conservative field. 

As an example let us look at the gravitational field produced by a mass M which 
evenly fills a sphere of radius R. Outside of the sphere the force is given by 


(x, 952) 
Vy 
(G is, of course, Newton's constant of gravity, sometimes also denoted by y. Its 
approximate value is 6.67428 x 10—11 m? Ка” 142.) 

As the reader can verify for him or herself F is curl-free. The potential is deter- 
mined by 


F(x,y,z) = -GM 


л (x,y,z) d (4х,4у,42) 
ебу) = n | 3 
If we choose the path of integration in the radial direction, the dot product r edr 
simply is rdr, and the integral is easy to compute. 
The bounds are: 


ro = bx and r=Vx2+y2+422 


"d 1 1 1 1l 
ебу) = 6m | "= om ( )- eu (2-5) 
ro Pl r ro ro r 


The potential р is fixed up to an additive constant GM /rg. By convention the 
potential energy is set to 0 as r — оо , so ф will be 


—GM 


p(x, yz) = -= 
i ) Vx? +y? +z? 


Just to make sure, let us compute the gradient in order to arrive at F again: 
(x,y,z) 


3 
Vx? +y? +z? 


F = —grado = -GM 
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17.10 Short Reference on Vector Derivatives 


Notation 

Үр = grado ф is a scalar function, grad is a vector field. 

У.Е = div F F is a vector field, div F is a scalar function. 

V x F = curl F F is a vector field, curl F is a vector field. 

Ү (Ур) = div (grado) = Ар Ais called the Laplacian operator, sometimes writ- 


ten as V2. 


Important Identities 
V x (Уф) = curl (grado) = 0 


У.(Ух Е) = div(curl F) =0 


For any scalar function ф, the curl of grado 
vanishes. 
For any vector field F , the divergence of curl F 
vanishes. 


V x (V x ЕЁ) =curl(curl F) = V-(V- F) - AF = grad (div Е) — Laplacian( F ) 


Special case spherical symmetry: 


Vx Е =curl F =0 


v.F-dwvr- 2/0) 970) 
or 


Further Information on the web: 


This identity holds true in Cartesian coordi- 
nates. 
r 


Е = feu excluding r — 0 
r 

The curl of a spherically symmetrical field van- 

ishes. 


Note the exceptional case of inverse square 


С 


functions f (r) = —. 
r 


http://eom.springer.de and http://mathworld.wolfram.com 


Exercises 


1. Given three squares with an area of 4 units each. They are placed 


(a) in the x-y plane, 
(b) in the x-z plane, and 
(c) in the y-z plane. 


Determine the surface elements. 
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2. Given a rectangle with area a · b, determine the vector element. 


3. Compute the flow of the vector field F (x, y,z) = (5,3,0) through the surfaces 
given by the respective surface elements: 


(a) A = (1,1,1) 
(b) A = (2,0,0) 
(c) A = (0,3,1) 


4. Find the vector surface elements for the cuboid shown in the figure. 


74 


5. Compute the flow of the vector field F (x,y,z) = (2,2,4) through 
(a) a sphere centered at the origin with radius R = 3. 
(b) the cuboid from exercise 4 
6. Compute the flow of the vector fields through a sphere centered at the origin 
with radius R. 
(x,y,z) 
x? +y? +z? 
(x,y,z) 


(b) F (x,y,z) = 
шаг Vite yu 


(a) F(x,y,z) 23 
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7. A vector field is given by F (x,y,z) = (z, y,0). Compute the flow through 
area A. 


8. Compute div F for the given vector fields Ё. Indicate the respective sources and 
sinks, if applicable. 


(a) Е (x,y,z) = (x - a,y,z) 
(5) Fx, y,z) = (а,—х,2?) 

9. Do the following vector fields possess curl? 
(a) Е (x,y,z) = (ax, b) 


(x,y,z) 

b) F = ——————À 

10. Compute the value of the line integral 4 F · ds along the rectangular path in the 
x-y plane with dimensions a and b. 


The vector field is given by F (x,y,z) = 5(0,y,z) 


Z 


х y 
11. For the vector field F (x,y,z) = (0,y,z) compute the line integral along the 
path as shown in the figure, i.e. from the point (0,0,0) to the point (0,2,3) and 
then to the point (0,0,3). 


Chapter 18 
Fourier Series; Harmonic Analysis 


18.1 Expansion of a Periodic Function into a Fourier Series 


In Chap. 8 we showed that a function / (х) which may be differentiated any number 
of times can usually be expanded in an infinite series in powers of x, i.e. 


f(x) = Y, anx” 
n=0 


The advantage of the expansion is that each term can be differentiated and integrated 
easily and, in particular, it is useful in obtaining an approximate value of the function 
by taking the first few terms. 

We now ask whether a function can be expanded in terms of functions other than 
power functions, and especially whether a periodic function may be expanded in 
terms of periodic functions, say trigonometric functions. Many problems in physics 
and engineering involve periodic functions, particularly in electrical engineering, 
vibrations, sound and heat conduction. A periodic function f (x) is a function such 
that f(x) = f (x+ L), where L is the smallest value for which the relationship is 
satisfied. 

Fourier’s theorem relates to periodic functions and states that any periodic func- 
tion can be expressed as the sum of sine functions of different amplitudes, phases 
and periods. The periods are of the form L divided by a positive integer. 

Thus, however irregular the curve representing the function may be, as long as 
its ordinates repeat themselves after equal intervals, it is possible to resolve it into 
a number of sine curves, the ordinates of which when added together give the or- 
dinates of the original function. This resolution of a periodic curve is known as 
harmonic analysis. 

To simplify the mathematics we will start by considering functions whose period 
is 2л; this implies (see Fig. 18.1) that 


f(x) = f(x+2n) 
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Fig. 18.1 


Expressed mathematically, Fourier’s theorem states that 


y-f()- Y. Ansin(ax + bn) (18.1) 


n=0 


Since sin(nx + n) = sinnx cos¢y +cosnx sin фп, we can express the function in 
terms of sine and cosine functions. We have 


y=f(x)= S + У (an cosnx + by sinnx) (18.2) 
n=1 


This series is called a Fourier series. The terms in Fourier series differ in period (or 
frequency). The nth term has the period 27 /n (or the frequency n/(2z)). The 1/2 
in а0/2 is to make ao fit the general equation. 


18.1.1 Evaluation of the Coefficients 


Before actually proceeding with the evaluation of the coefficients, we will state 
the results of some definite integrals in the range -л to л, where n and m are 
positive integers. It can be shown that the same results are obtained in the range 
from 0 to 27. 


л л 
J cosnx dx = n: sinnx dx = 0 [1] 
-л -л 
л л 0 
! cosmx cosnx dx = | sinmx sinnx dx -| АК [2] 
-л -л л, MSN 
л 
| sinmx cosnx dx = 0 [3] 
x 


The integrals in Eq. [1] are standard. 
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Let us evaluate the first integral in Eq. [2]. We integrate by parts: 
л 1 л 
| cosmx cosnx dx = | —sinmx cosnx + =f sinmx sinnx dx 
-7 т -r m 
The first term is zero. The second term can be integrated by parts once more: 


2 


7 Р n ; л n Е 
“| sinmx sinnx dx = |- созтх sinnx| +22 |. COSMX СОЅИХ ах 
т т -T m 


Again the first term is zero. Inserting this result into the original integral and rear- 


ranging gives 
n? 2 
1-— 1 cosmx cosnx dx =0 
m2 — 


Thus the integral is zero except for n? /m? = 1, i.e. n = т. 
In the latter case we have a standard integral, the result of which is known: 


X 
1 cos? mx dx = л 
= 


The integral with sine functions may be solved by the reader. The solution can be 
obtained in exactly the same way. 

The integral in (18.3) may be solved in the same way too. In the case of m = n 
we have 


T 1 JU 
T sinmx cosmx dx — = sin2mx dx = 0 


= -T 


Evaluation of ag 


To find ao we integrate the Fourier series from —z to л: 


+T 89 +л +л 
К a= f ао dx + p: (oj cosnx dx-+Dn | sinnx ax) 


Е n=1 -л -л 


According to Eq. [1] above, all integrals in the infinite sum vanish. Hence 


Tz ] /7 
1 TOLSE] ад dx = Tao 
-7 2 J-x 


Therefore, we have obtained ao: 


о=1 уо) а 


Note that ao /2 is the average value of the function in the range —z to +z. 
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Evaluation of an 


The coefficients ал have to be evaluated one by one. For a given n = k multiply the 
Fourier series by cos kx and integrate from —z to +z: 


TX TX ад eo, л 
f (x)coskx dx = ! гр coskx ах +) | dncosnx coskx dx 
174-7 


-л 


ee. Tz 
+ »3 | by sinnx coskx dx 
n=1“ 27 


By virtue of equations 1 and 3 above, all integrals on the right vanish except the one 
for n = k. We thus obtain 


+л +T +л 
2 


аһ cosnx cosnx dx = an | соѕ пх dx = аһл 
л: 


f (x)cosnx dx = / 


-л 


1 л 
Hence аһ = = | f (x)cosnx dx 
= 


Evaluation of bn 


We proceed in the same way: we multiply the Fourier series by sin kx and integrate 
in the range — to л. All integrals on the right vanish except for n = k. 


i 2 
| by ѕіп пх 
л 


1 л 
Непсе bn = -J f (x)sinnx dx 
Л /-л 


І 


by 


The result is: 


If a function f (x) of period 27 can be represented in a Fourier series then 


T(x) = 25 х an cosnx + У by sinnx 


n=1 


where ao = Чи (18.3) 
an= 2 |" f(x)cosnx dx  n=1,2,... (184) 
Л /-л 


bn 


! 


1 л 
-J f (x)sinnx dx n=1,2,... (18.5) 
Л /-л 


Since f (х) is a periodic function of period 2л we could, if we wished, use 
the range 0 to 27 instead, or any other interval of length 27r. 
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The terms cos x, sin x are known as the fundamental or first harmonic, cos 2x, 
sin 2x as the second harmonic, cos 3x, sin 3x as the third harmonic and so on. 

We have not yet discussed the conditions that must be satisfied by f(x) for the 
expansion to be possible. There are, in fact, several sufficient conditions which guar- 
antee that the Fourier expansion is valid, and most functions the applied scientist is 
likely to meet in practice will be Fourier expandable. 

We should mention one criterion which is connected with the name of the emi- 
nent mathematician Peter С. L. Dirichlet (1805-1859). 

Dirichlet’s lemma states that a periodic function f(x) which is bounded (i.e. 
there is a constant B such that | f (x)| < B for all x) and which has a finite number 
of maxima and minima and a finite number of points of discontinuity in the interval 
[—L; L] has a convergent Fourier series. This series converges towards the value 
of the function f (x) at all points where it is continuous. At points of discontinuity 
the value of the Fourier series is equal to the arithmetical mean of the left-hand and 
right-hand limit of the function f (х), i.e. it is equal to 


1 
=[ lim f(x+Ax)+ lim f(x —Ax)} 
2 Ax —0 Ax —0 

Ax 20 Ax 50 


The proof of this lemma is beyond the scope of this book, and the reader should 
refer to advanced books on mathematics. 


18.1.2 Odd and Even Functions 


Even Functions 


A function is even when f (х) = f(—x). In this case all the coefficients b, vanish. 
Since f (x) sin nx is an odd function, its integral from —z to л is zero. 


For an even function the Fourier series is 


ao 2 
Xx)——- An COS" X 
f= 9+ У, а 


Odd Functions 


A function is odd when f(x) = — f (—x). In this case all the coefficients ал vanish. 
Since f(x) cos nx is an odd function, its integral from —z to л is zero. 


For an odd function the Fourier series is 


f(x) = Y by sinnx 
n=1 
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Thus the Fourier series for an even function consists of cosine terms only, 
whereas that for an odd function consists of sine terms only. 
18.2 Examples of Fourier Series 


Sawtooth Waveform 


The sawtooth function is shown in Fig. 18.2 with a period of 27. It is defined by 


Fig. 18.2 


Since the function is odd, only the coefficients by, are required. Because there 
are two branches of the function we have to split the interval of integration. We 
then have 


b, = ЯВ: = +1) ) sinnx dx + — tah (=-1) sinnx dx 
1 л 
5 NE ас е ax—— | sinnx dx 
л ёс л JO 


The first integral can be solved by parts; the other two are standard. 
Integrating gives 


л 1 л 1 0 
bn = – ЁЛ2 + E чин ын EL 
= z?n n E 
—— 
-0 = 
л 
+ EL 

лп 0 

2 
bn IE 
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6 terms 


Fig. 18.3 


Hence, for the sawtooth waveform the Fourier series is 


х sinnx 


2 
Tu on 


fœ) =- 


Figure 18.3 shows the first six terms of the expansion in the range — r < x <0. As 
the number of terms is increased the series gets closer and closer to the function. 


Triangular Waveform 


The triangular function is shown in Fig. 18.4. Its period is 2л. It is defined by 


Fig. 18.4 
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Half sine wave, such as a rectified alternating current. 
Since it is an even function, we only need to calculate the coefficients ад. Because 
there are two branches of the function we have to split the interval of integration. 


а= | i x) dx+— FN хах = л 


аһ = T M )cosnx dx + — і xcosnx dx 


Integrating by parts gives 


0 0 m л 
1 x? s 1 Men: 
an = — 5 COSNX — | — sinnx + 5 COSNX + | —sinnx 
wn -r Tn zn 


2 ( 1) 
= —7(COSNT — 
an ae 


If n is even cos nz = +1 hence an = 0 
If n is odd cos nz = —1 hence аһ = ———m 


The Fourier series of the triangular waveform is 


X Z cos(2n + 1)x 
f073 2:2 Qn 41? 


Rectangular Waveform 


The function is shown in Fig. 18.5. It is defined in the interval —z to л by 


edu ep E a 
Р(х) = 1, -$«x&€5 
mp Wb 

f(x) 


Fig. 18.5 
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Since the function is even, we need only calculate the coefficients ал. 


do = =| f(x)dx = (we can see this by inspection) 
z/2 л/2 л 
аһ = — IS cosnx мө? cosnx ax- f cosnx ax) 
л -n л/2 
2 . л 
an = — sin — 
” zn 2 


Figure 18.6 shows approximations to the function. We see that as we take more and 
more terms we approach the original function f(x) = fi + fa + 33 +: 

There is, however, a snag to approximating a function at a point of discontinuity 
such as x = zx /2 in our example. Even if a lot of terms are added, fı + f2 + f3 + 

+ fpg, the graph overshoots before and after the discontinuity. 

This is known as the Gibbs phenomenon. For most practical purposes this effect 
can usually be neglected; but if special attention is to be paid to the function at 
a discontinuity, the Gibbs phenomenon must be kept in mind. 


Athths 


Fig. 18.6 
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Half sine wave, such as a rectified alternating current. 
The function is shown in Fig. 18.7. It is defined by 


-л<х<0 
Iosinx , О<х<л 


І= 1р sin wt = h sin х 


Fig. 18.7 


The coefficient ао is 


I 0 T I 
a= (/ оах+ | Чихах)-22 
л -7 0 VIA 


To find the coefficients a, and b, we use two identities which are based on the 
addition formulae (Chap. 3, Sect. 3.6.6): 


sin (n + 1)x = sin nx cos x + sin x cos nx 


sin (n — 1)х = sin nx cos x — sin x cos nx 
Subtracting gives 
sin (n + 1)x —sin(n — 1)х = 2 sin x cos nx 
Similarly, we obtain 


cos (n + 1)x = cos nx cos x — sin nx sin x 


cos (n — 1)х = cos nx cos x + sin nx sin x 
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Subtracting gives 
cos (n — 1)x —cos(n+ 1)x = 2sin nx sin x 


Now we evaluate an: 


Io (7 Io (71 
= | sinx cosnxdx = 2 | = (sin(n + 1)x — sin(n — 1)x) dx 
л Jo л Jo 2 


üg — 
Io | cos(n+1)x соѕ(и — D)x |^ 
= (n #1) 
2л п+1 п 1 0 
Io (7. 
If n=1, а= | sin x cos хах = 0; hence a; = 0 
л ЈО 
If n is odd n + 1 апал — 1 are even, so that 
dn =0 (n odd) 
If n is even n + 1 апал — 1 are odd, so that 
21 
аһ = 9 (n even) 


л(п+1)(п– 1) 


The coefficients bn are 


I л 
b, = 21 sinx sinnx dx 
л Jo 
lo (71 : 
= | cos(n — 1)x — cos(n + 1)x | dx = 0 ifnzl 
л Jo 2 
Io (Л I 
If n=1, в-2| sin? х dx = 2 
x Jo 2 
The Fourier series for the rectified waveform is 
I 2 2 2 
Р(х) = - (1+ 5 sin x ET 3 C08 2x a = Cos 4x zx 7 C08 Ox =) 


18.3 Expansion of Functions of Period 2L 


A periodic function f (x) of period 2L is repeated when x increases by 2L, i.e. 


f(x+2L) = f(x) 


If we put z = x /L x, then the new function f(z) is a periodic function of period 
2л. As x increases from — L to L, z increases from — to л, and (18.2) holds true 
for the new variable z. We have 


fz)- T + У (an cosnz + b, sinnz) (18.6) 


n=1 
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To get back to the original function, we simply replace z by x / L x and obtain 


Т(х)- шир У (an cos шинээ sin Tx) (18.7) 
2 4 L L 


The coefficients of a Fourier series of a function with period 2L are 


1 L 
a - 7 | f (x) dx (18.8) 
L J-L 
1 sb пл 
an = т | feos x dx n= 1,2... (18.9) 
bn = Эр f(x)sin =x dx n=1,2 (18.10) 
gs x L* Sl Des. : 


Rectangular Waveform of Period 4 


The function is defined by 


ын 2<x<0 


1, 0<х<2 


1 2 

From (18.8): ао = = dx = 
2 Јо 
1 2 

From (18.9): аһ = = cos ae dx =0 
2 Jo 2 
1 2 1 

From (18.10): bn = sin Z> dx = (1— cos пл) 
2 Jo 2 пл 

We therefore have 
f(x i= 12 sin = + А ues 
ВР 2 2 2 


18.4 Fourier Spectrum 


A periodic function of time, such as a vibratory motion, expressed as a Fourier series 
is often represented by a Fourier spectrum. It consists of the values of the amplitudes 
and phases of the different terms as a function of the frequency. This is shown in 


18.4 Fourier Spectrum 503 


Fig. 18.8. a; is the amplitude of the fundamental frequency w; a2, аз etc. are the 
amplitudes of the harmonics 20), Зо, etc. and фі, фо, etc. are the phase angles when 
the function is expressed in the form of (18.1). For example, if s is a vibration or 
a signal in an electrical system then (18.1) takes on the following form: 


s= У ansin (not + dn) (w = frequency, t= time) 
n=1 
A periodic electrical signal introduced into a transducer, filter or amplifier will be 
modified; it will be damped and distorted unless special precautions are taken in the 
design of the equipment. By expressing the signal as a Fourier series it is easy to 
find out how each Fourier component is affected. 

For sine and cosine functions these modifications are often easy to find either 
empirically by measurement or theoretically by calculation. As a rule, modifications 
depend on the frequency of the function. 

Thus the modification of an arbitrary periodic signal can be obtained by express- 
ing the signal as a Fourier series, finding the modifications of the Fourier compo- 
nents and reconstructing the modified signal. 

In a transmission line, for example, it is most important to ensure that a signal 
is not distorted. This implies that the relative amplitudes of the terms making up 
the signal, as well as the phase angles of the harmonics, are faithfully reproduced. 
In other words, it is necessary to ensure that the time shift due to the transmission 
remains the same for all harmonics. 


Frequency 


Fig. 18.8 
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Exercises 


1. Obtain the Fourier series for the function (Fig. 18.9) defined by 


0, mq 
fox)-—4l, теля 
0, me XS 


Fig. 18.9 


2. Obtain the Fourier series for the function (Fig. 18.10) defined below. The func- 
tion is periodic with period 27. 


Fig. 18.10 
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3. Obtain the Fourier series for a rectified waveform (Fig. 18.11) given by 


—sinx, —z«x«0 
sin х, О<х<л 


f(x) =|sinx| = 


Use the results obtained in the example on p. 498. 


Fig. 18.11 


4. Obtain the Fourier series for the function (Fig. 18.12) defined below. Its period 


is 47r. 
0, —-2n Ex«-—m 
х) = 1, -я«х«л 
0 л<х<2л 


Fig. 18.12 


Chapter 19 
Probability Calculus 


19.1 Introduction 


The concepts and methods of probability have been used increasingly in the last 
decades. They are the basis for an understanding of a large part of modern physics, 
both theoretical and applied, e.g. statistical and quantum mechanics. 

Statistical mechanics describes physical systems such as gases, solid bodies, and 
fluids, all of which consist of many atoms and molecules. The properties which refer 
to a single element of a system are called microscopic, e.g. state of motion, position, 
kinetic and potential energy. 

Those which refer to the whole system are called macroscopic, e.g. pressure, 
volume, temperature, magnetism, electrical conductivity and so on. 

Statistical mechanics attributes the macroscopic properties of the whole system 
to the microscopic properties of the constituent elements. Statistical mechanics uses, 
above all, the fact that the whole system consists of a very large number of elements, 
e.g. 1 litre of air contains about 102° gas molecules. 

Quantum mechanics describes physical objects like atoms, atomic nuclei etc. In 
quantum mechanics statistics plays a major role, since only probability statements 
can be made about the properties of these objects. 

New mathematical methods are needed for the quantitative treatment of physical 
systems with a large number of elements; and it has been found that the theory of 
probability provides the necessary methods for the mathematical treatment of such 
systems. One important fact is that the larger the number of elements of a system 
under consideration is, the more closely will the results of the theory of probability 
agree with experiments. 

A further field of application is the theory of errors. All physical measurements 
are in principle liable to errors. Before we can draw useful conclusions about an 
experiment, we must first estimate the errors. In Chap. 21 we will show how to 
infer the accuracy of measurements from the scatter of points, i.e. the variance of 
experimental data. 

An important application is quality control in the small or large-scale industrial 
production of almost any product. Manufacturers need the sound knowledge of qual- 
ity control which statistics can provide. 
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19.2 Concept of Probability 


19.2.1 Random Experiment, Outcome Space and Events 


The concept of probability in mathematics has been derived from the colloquial 
word ‘probable’. If the sky is cloudy we say: ‘It will probably rain today’. Of course, 
we are not sure that it will rain, i.e. whether the event will occur or not, but there is 
a likely chance that it will. It is the aim of probability theory to calculate and express 
quantitatively the degree of certainty or uncertainty in the possible occurrence of 
an event. 

The concept of probability can be deduced from an analysis of well-known games 
of chance. Suppose a die is thrown. This can be repeated arbitrarily often. A process 
which can be repeated arbitrarily many times in accordance with certain rules is 
called an experiment. In this case the experiment can have six different outcomes. 
The possible outcomes are the number of spots on the faces of the die, i.e. the 
numbers 1, 2, 3, 4, 5, 6. The outcome cannot be forecast with certainty. Such an 
experiment is called a random experiment. 

The outcome of a random experiment depends on the statement of the problem. 
When throwing a die, as in our example, we could ask: ‘What is the number of 
spots?’ or ‘is the number even or odd?'. 

In the first case, the set of outcomes is 


{1, 2, 3, 4, 5, 6} 
In the second case the set of outcomes is 
{ even, odd } or {{2, 4, 6H 1, 3, 5H 


The set of outcomes is called the outcome space or the sample space of the experi- 
ment. 

Possible outcomes with equal probability are called elementary events. Equal 
probability means that no outcome is more favoured than any other for physical or 
logical reasons. 

In the case of throwing a die, we have six elementary events, the number of spots: 


R= {1, 2, 3, 4, 5, 6} 


Any set of elementary events is called an event. 

In the case of throwing a die and regarding the outcomes as ‘number even’ or 
‘number odd’ we have two events, each of which is a subset of the set of elementary 
events: 

{2, 4, 6}{1, 3, 5} 


Thus an event can be any outcome of a random experiment, depending on the 
problem: 


1. an elementary event; 
2. any combination of elementary events. 
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For the sake of completeness, an impossible event is defined as an event which is 
not an element of the outcome space. For example, when throwing a die, the event 
‘number of spots = 7’ is impossible. 


19.2.2 The Classical Definition of Probability 


Let a random experiment consist of N equally possible outcomes, i.e. elementary 
events. Let us consider an event A consisting of Na elementary events. The proba- 
bility P of outcome A is given by the classic definition of probability: 


Definition Classical definition of probability 


P(A) Na number of elementary events contained in event А 
O N total number of possible elementary events 


Some authors refer to the number № as ‘favoured’. 


Example What is the probability that an even number will appear when throwing 
a die? 

There are 6 possible ways a die can fall; hence N = 6. Of these 6 there are 3 
which will show an even number (2, 4, 6} and 3 an odd number (1, 3, 5}; therefore 
Na = 3. Consequently the probability that an even number will appear is 


1 
Р (even) = n Pes = 
N 6 2 
Example A pack of 52 cards contains 4 jacks. What is the probability of drawing 


one jack? 

It is possible to draw 52 different cards; thus N = 52 (possible elementary 
events). Since there are 4 jacks in the pack, we have 4 possibilities to realise the 
event ‘one jack’; hence Na = 4. Therefore the probability of drawing one jack is 


: МА 4 1 
Бонн Эг 


The basis of the classic definition of probability is the assumption that all elementary 
events are equally probable. In reality, all experiments carried out with real dice, 
playing cards and other games approach this assumption only to a certain degree. 
A perfect or fair die, for example, is one whose material is absolutely homogeneous 
and whose shape is perfect. 


19.2.3 The Statistical Definition of Probability 


In many cases it does not make sense in practice to assume that all elementary events 
are equally probably. For example, a die becomes deformed after long use, or the 
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centre of mass may be deliberately altered so that it is ‘loaded’ with the intent to 
defraud. 

In cases of this kind we can also state the probability that a particular event 
will occur. 

Let N experiments be carried out under identical conditions and let Na of them 
have the outcome A. The expression 


N 
ha = Лү is called the relative frequency . 


The relative frequency is a quantity which has to be determined experimentally. It 
should not be confused with the classic definition of probability. Na is the actual 
number of experiments with outcome A. 

If the values of the relative frequency do not change for a large number N of 
experiments actually carried out, then we can interpret the relative frequency as 
the probability. We call the probability obtained from the relative frequency the 
statistical probability P(A). 


Definition The statistical definition of probability is the limit of the relative 
frequency of an event A. 


P(A) cha = = 


as N — œ 


In practice, of course, it is not possible to carry out an arbitrarily large number of 
experiments. 


Example The probability of getting a 1 when throwing a die can be determined 
empirically, as shown in Fig. 19.1 which is the result of an actual test. The relative 
frequency approaches the value of 1/6 as the number of experiments increases. If 


Fig. 19.1 
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we repeat the test, the shape of the curve will be different at first but will again 
approach the value 1/6 for a large number of experiments. 

If the curve approaches a different value we can deduce that the die is faulty or 
loaded. In this way we can check the die. 

The concept of the statistical definition of probability refers to actual experi- 
ments. It can be applied to the result of scientific measurements (from observations 
of the radioactive decay process we can derive the probability of the decay of a ra- 
dium atom, for instance). In the case of throwing ideal dice or similar games, both 
definitions of probability coincide for large values of N. Unless we state otherwise, 
we will use the classic definition; this means that we will consider ideal experiments. 
In Chap. 21, which deals with the theory of errors, we shall concern ourselves with 
the analysis of the results of real experiments. 


19.2.4 General Properties of Probabilities 


We will now derive a few general properties of probabilities which will be useful in 
practical calculations by means of an example. 


Example А box contains № balls. N; balls have the number 1 written on them, N2 
the number 2,... and №; the number k. k is the largest number on a ball. 


In our case let N=9, k=4 
N,=2 № =1 N3=4 N4-2 
чиний ирээ! —Ó—— чини чий чилээ —_ 
(D (D © 0 000 0 0 
1 ji T ii 
2 balls 1 ball 4 balls 2 balls 
with the with the with the with the 
number 1 number 2 number 3 number 4 


We take one ball out of the box. Let this ball carry the number j. This event is 
denoted by j whose probability is 


Now we want to find what the probability is of our taking out a ball with the num- 

ber i onitoraball with the number /. This event is represented by the balls carrying 

the number i or j. There are №; and №; balls with the number i and j respectively. 
The probability of our taking out one ball in this subset is 


Ni +N; Ni " Nj 

м м N 
This is the addition law for probabilities and is only valid if the events are mutually 
exclusive or disjoint, i.e. independent. 


P(iorj)= 


= P(i)+P(j) 
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Rule Addition law for independent probabilities 
P(i or j) = P(i) - P(j) 
This theorem can be generalised for any number of mutually exclusive events. 


Rule : 
P(lor2or ... ork) = У P(i) 
i=1 


If the set of events is equal to the set of all possible events, then the probability 
P(10r2 or... or k) is a certainty. In such a case, the normalisation condition must 
be satisfied. 


Rule Normalisation condition for probabilities 
k 
S P(i)=1 


k 
Proof Let the total number of events be N, then Y №; = N 
j=1 


t= 


k k N; k 
and hence ЭР ен YN =—N= 


An event with the probability P = 1 occurs with certainty. 

What is the probability of taking out of the box a ball with the number m, where 
m > К? Since К is the largest number on a ball, we know that there are no balls with 
the number m. Therefore 


This means that an event which cannot take place has the probability zero: it is an 
impossible event. 


Example Each group of spots from 1 to 6 on a die has the probability 1/6 of ap- 
pearing when the die is thrown. 
The probability of throwing either a 1 ora 2 is 


1 1 1 
Р(10о:2)--4--525 
е 
The probability of throwing an even number is 
1 1 
P = — — — = — 
(even) + 6 + 2 
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The probability of obtaining any number between | and 6 is a certainty: 


1 1 1] 1 21 1 
Р(1 012013 ог40г5ог6) = < +<+е+е+е +1 


The probability of throwing а number greater than 6 is zero, an impossible event. 
We will consider one more relationship of this kind. Let the probability P for the 
occurrence of an event і be known and let N be the total number of possible events. 
To obtain the number N;, we need only solve the equation P(i) = N;/N for 
N;. Hence 


Nj = P(i)N 


Example When throwing a die, let N — 60. The probability of throwing a 1 is 
P(1) = 1/6. The number Nj for the event of ‘getting a 17 is 


1 
М = P(1)N = c x 60— 10 


This means that, on average, after 60 throws we can expect the number 1 to appear 
ten times. 


19.2.5 Probability of Statistically Independent Events. 
Compound Probability 


In this section we will determine the probability of the simultaneous occurrence of 
two different events. 

For example, a person throws a die and at the same time tosses a coin. We are 
interested in the probability of the die showing a 6 and the coin showing a tail. The 
experiments and their outcomes are independent of each other. 

Such composite events are called compound events. 

Suppose we have two groups of independent events, the number of spots on the 
die (group A) and the faces of the coin (group B). We require the probability of 
the simultaneous occurrence of these two events. The table lists all events and their 
possible combinations. 


Coin: 2 Die: 6 elementary events 

elementary events 1 2 3 4 5 6 
Head (H) HI H2 H3 H4 H5 H6 
Tail (T) T1 T2 T3 T4 T5 T6 


Compound events 


We see that there are 12 possible elementary compound events. 
The total number of elementary compound events is equal to the product of the 
number of elementary events of group A (the die) and the number of elementary 
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events of group B (the coin). Hence 
N=N,x № 


In the example above № = 6, № = 2, N = 12. 
From the table above we can determine the probability of a particular compound 
event when the die is thrown and the coin is tossed. 


Example What is the probability that the die will show a 6 and the coin a tail? 
In this case Ngr = 1 and N = 12. Hence P(6T) = 1/12. 


Example What is the probability that the die will show an even number and the 
coin a head? 

The number of events is N = 6 x 2 = 12. 

For group А (die), № = 3. 

For group B (coin), № = 1. 

Probability P (even, head) = 3x 1/12 = 1/4. 

The concept of statistically independent events is important. In our example oc- 
currence of the number 6 on the die is in no way influenced by the coin showing 
head or tail. The two events are independent. 


Definition If the probability of the occurrence of an event in group A is in 
no way influenced by the occurrence of an event in group B, 
then we say that the events Ац,..., An are 
statistically independent of events B1,..., Bm. 


The compound event ‘number even’ on the die and ‘head’ on the coin is made 
up of the events ‘number even’ and ‘head’. Hence the total number of compound 
events is given by the product of elementary events A and B. Thus 

N, number even, head — N, number even X М head 
The required probability of both events occurring is 


N, even X Ni head N, even Ni head 
P (even, head) — = x 
i №№ № № 


Rule The probability of the occurrence of a compound event — А and В – 
is for statistically independent events, A and B, the product of the 
probability of the occurrence of A and of B 


P(A and B) — P(A) x P(B) 


The probability of compound events is generally less than the probability of sin- 
gular events: 


P(A and B) « P(A) 
P(A and B) < P(B) 
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Example The probability that a thirty-year-old man will reach the age of 65 is, ас- 
cording to statistics, P (man) e 0.75. The probability that his wife (who is the same 
age) will live through these 30 years is P(woman) z 0.80. What is the probability 
that both will be living 30 years hence? 


P (man and woman) = 0.75 x 0.80 = 0.6 
Note that 


P (man and woman) 


P (man and woman) 


19.3 Permutations and Combinations 


To calculate probabilities the standard problem is to calculate the number of ele- 
mentary events contained in an event A. To simplify the solution of these problems 
we introduce the concept of permutations and combinations. 


19.3.1 Permutations 


Given n objects, we may place them in a row in any order; each such arrangement is 
called a permutation. Thus two elements, a and b can be arranged in two different 
ways: ab and ba. Three elements, a, b and c, can be arranged in six different ways: 


abc, bac, cab, acb, bca, cba 


How many permutations are there for n different elements a1, a5,...,a5? The fol- 
lowing reasoning leads us to the solution. First of all, we assume that the n possible 
places are empty. The element a, can be put in any place, i.e. there are n possibil- 
ities. The element a2 can be put in any one of the remaining (п — 1) places which 
are still empty, i.e. there are (п — 1) possibilities. Thus the number of possibilities 
for placing the first two elements is given by 


n(n — 1) 


For element аз there are (n — 2) possibilities, etc. until we reach element ал for 
which there is only one possibility left. 
Thus the total number of arrangements of the n elements a1, а2,..., ап 18 


п(п – 1)(п– 2)...1= п! 
Rule The number np of permutations of n different elements is 


np=1x2x3x...x(n—I)n=n! 
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Remember from Chap. 8, Sect. 8.2, that n! is referred to as ‘factorial п’. We also 
defined factorial zero as 0! = 1. 

For example, 3! = 1 x 2x 3 = 6 is read as ‘3 factorial equals 6’. 

It is a little more difficult to obtain the number of permutations of n elements if 
some of them are equal as, for example, in the case of the three elements a, b, b. 
We can write the number of permutations of these 3 elements thus: 


abb, bab, bba, abb, bab, bba 


But note that there are three pairs of identical permutations; hence there are only 
three different permutations. The identical permutations are formed by rearranging 
equal elements. If we are interested in the number of different permutations, then 
we must divide the total number of permutations by the number of permutations of 
equal elements. 


3! 6 
N. SS SSeS EL 3 
abb 2! 2 
Rule Ifoutof7 elements 711, n2, ..., nm elements are equal to each other, 


then the number of different permutations is 


n! 
цан п1!й21...Ит! 
1:12 т 

Proof Let np be the required number of permutations. From any of these, if the 
n,-like elements were different, we could make и! new permutations. Thus if the 
n,-like elements were all different, we would get трлі! permutations. 

Similarly, if the n2-like elements were different, we would get 72! new permuta- 
tions from each of the second set of permutations. 

Thus, if the 7,-like elements and the n2-like elements were all different, we 
would get npn !2?! permutations in all. 

The process is continued until all the sets of like elements are dealt with, and we 
then get the number of permutations of и elements which are all different. This is п! 

Therefore прл n2! ... ng! = n! 

n! 


Hence Пр = — — —— — 
ny!n2!...Nm! 


19.3.2 Combinations 


We now alter the problem considered in the previous section to ask the number of 
different ways in which k elements can be selected out of n elements. 


Example Consider the four letters a, b, c, d (n = 4). They can be arranged in groups 
of two (k — 2) as follows: 
ab, ac, ad, bc, bd, cd 


That is, there are six groups in all. 
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Example Now consider the five letters, a, b, c, d, e (n = 5). They can be arranged 
in groups of three (k — 3) as follows: 


abc, abd, abe, acd, ace, ade, bcd, bce, bde, cde 


That is, there are 10 groups in all. 


Definition Each of the groups which can be made by taking k different 
elements of a number of elements n is called a combination of 
these elements. 


Generally, if there are given n different elements and each group contains k of 
them, the symbol „Сұ or [P is used to denote the number of all possible groups. 

In other words, „С is the number of combinations of n different elements taken 
k at a time without repetition. This is also referred to as the kth class. The number 
obtained is given here without proof: 


! 
Binomial coefficient: (AS = a Sewer 
(t) nC k!(n — k)! 


Note: This is called the binomial coefficient. These coefficients occur in the bi- 


: : n ш n n—k pk 
nomial expansion (a +b)” = У к) р“. 
k=0 
n n n! 
i = = — ХІ i ! = 
Note the special case (0) (^) 0111 l since 0!= 1. (19.1) 


Example A club has 20 members. The managing committee is formed by 4 mem- 
bers having equal rights. How many combinations are possible in choosing a com- 
mittee? 


We must choose 4 members out of the 20. Thus there are (2 combinations: 


zæ 5000 


20V 201  20x19x18x17 
4] A4MHd6! 4х3х2х 1 


So far we have considered combinations containing different elements only. Now 
we will consider combinations where several elements are allowed to recur. We will 
see that, in this case, more combinations are possible. 


Example Given the four letters a, b, c, d, how many combinations are there when 
taking 2 letters at a time? 


(1) Without repetition: ab, ac, ad, bc, bd, cd 


These are 6 combinations 6 — : : 
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(11) With repetition: there are 6 combinations as in (i) plus aa, bb, cc, dd, giving 10 


combinations, i.e. 4 more than when repetition is not allowed. 


We give the rule without proof. 


Rule There are Ч E B ) combinations of n different elements taken 


k at a time with repetition. 


Up to now we have not made any allowance for the order in determining the 


number of combinations. If we distinguish between combinations with the same 
elements but with different orders, we are considering variations. We give the rule 
for the number of variations without proof. 


n! € : 
Rule There are ————— variations of n different elements taken k at 


(n—k)! 


a time with repetitions and taking account of different order. 


Exercises 


1. 


2: 


A menu contains five dishes from which two can be chosen freely. Give the 
sample space. 

A set of cards consists of 16 red and 16 black cards. What is the probability of 
drawing a black card out of the pile? 


. What is the probability that when tossing a die the number that appears will be 


divisible by 3? 


. An experiment is carried out 210 times. Outcome A has been measured 7 times. 


What is the relative frequency of outcome A? 


. A box contains 20 balls. Of these, 16 are blue and 4 are green. Evaluate the 


probability of taking out a blue ball and, after putting it back, the probability of 
drawing a green ball. 


. What is the probability that when tossing two dice we will obtain a sum of 2 


spots after the first throw and a sum of 5 after the second throw? 


. A player casts with two dice. What is the probability that a 2, a 3 or a 4 will be 


thrown? 


. A Sales representative has to visit 6 towns. In how many ways can the route be 


fixed so that the journey always starts from town A? 


. A teacher has a class of 15 pupils. Of these, 3 must be selected for a special task. 


How many possibilities are there? 


Chapter 20 
Probability Distributions 


20.1 Discrete and Continuous Probability Distributions 


20.1.1 Discrete Probability Distributions 


In the practical treatment of statistical problems, it is wise to characterise individ- 
ual outcomes of a random experiment with numerical values. A simple method is 
to number the individual outcomes consecutively. For example, the outcomes of 
throwing a die can be described by the number of the spots on the faces. 

The set of numerical values can be regarded as the range of definition of a vari- 
able, which is called a random variable or a variate. The outcomes of a random 
experiment are therefore assigned to the values of the random variable within its 
range of definition. 

If the range of definition consists of discrete values, we call the variable a discrete 
random variable. We are then in a position to assign to each value of the random 
variable the probability of its outcome in a random experiment. 

This can be expressed symbolically as 


Outcome of random experiment j — random variable x ; — probability P() 


The complete set of probabilities for each value of the discrete random variable in 
a random experiment is called a discrete probability distribution. 


Definition A discrete probability distribution is the complete set of proba- 
bilities of the discrete values of the random variable in a random 
experiment. 


Example Consider an ideal or 'fair' die. There are six possible outcomes when 
throwing a die. The discrete random variable, in this case, ‘number of spots’, as- 
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Random Probability 
variable x Р(х) P) 
1 Р) 2 
2 Р(2) = 1 
3 Р(3) = 1 
Ld 1 
: PO) 2 пере 
5 Р (5) = 1 ! ' П 1 ! І 
E ' ! ! i I \ 
6 P(6)=Ł ' 1 1 1 ' t 
0 1 2 3 4 5 6 x 
Fig. 20.1 


sumes the values from 1 to 6. Each value of the random variable has the prob- 
ability 
1 
Р(х) = = (x = 1, 2, 3, 4, 5, 6) 


The probability distribution for this somewhat trivial example is shown in Fig. 20.1. 
In the figure, the values of the discrete random variable are plotted on the abscissa 
and their respective probabilities along the ordinate. 


Example Two dice are thrown. We choose as the random variable the sum of the 
number of spots. Thus the random variable assumes the values 2, 3, 4,..., 12. We 
now seek the probability distribution for this random variable. 
The outcome x = 2, for example, can only be realised if each die shows a ‘1’. 
The probability for this event is 


на) 


The outcome х = 5, as another example, can be realised by 4 elementary events: 


First die Second die Sum 
1 4 5 
4 1 5 
2 3 5 
3 2 5 


The probability distribution in this example is shown as Fig. 20.2. 

We could have numbered the values of the random variable in a different way. 
The outcomes could have been assigned, just as easily, to the numbers 1 to 11 or 
to any other set of numbers consisting of 11 values, e.g. 100 to 110, but this would 
surely have made it more difficult to relate the values of the random variable to the 
numerical results of the experiment. 
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Random Probability Р(х) 
variable х Р(х) 


1 
2 36 
2 
3 36 
3 
4 36 
4 
5 36 : 
5 
6 36 6 ! 
7 e ! 
$ t?! ? 
8 2 po! 0d 
36 шиг 
9 4 T 3: 4.0, ft 
36 азе 
10 36 ыг. 
11 2 егт! эээ. 
36 won cb UO OR dodo 4 
215 аа 
d: 36 12220687. 
аг. Q0)! 
0 2 3 4 5 6 7 8 9 1011 12 x 


Fig. 20.2 


Example Consider a cylinder, having a base of 1 square unit, containing air and 
subdivided into 5 regions, as shown in Fig. 20.3. 

There are no material partitions between the regions. Let us consider an arbitrary 
air molecule. The purpose of the experiment is to measure the position (height) of 
this air molecule at a particular instant when the air has been thoroughly mixed. We 
wish to evaluate the probability that we will find the air molecule in a particular 
region. We number the regions 1 to 5; these are our random variables. The volume 


Regions 
Sey 5 
р 4 
3 
2 
H 
І 
Y 


Fig. 20.3 
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Random ^ Volume of Probability рүү 
variable x region Vy Р(х) 


бал 


AkwWwNnN eS 
— — m оо 
4 o= BIE NI 
ЭЭ 


н Ф 


1 
1 
1 
1 
2 


Fig. 20.4 


of each region is given in Fig. 20.4. The probability P(x) of the air molecule being 
at a particular position is given by the ratio of the partial volume Vy to the total 
volume V of the cylinder. Hence 


The table and the diagram show this discrete probability distribution. In this ex- 
ample, we assumed that the positional probability in a region was proportional to 
the volume of the region. 


20.1.2 Continuous Probability Distributions 


The random experiments considered so far had discrete values of the random vari- 
able. There are, however, many random experiments whose results are best ex- 
pressed by means of a continuous variable. 

Let us consider again the example of the cylinder filled with air and an arbitrarily 
chosen air molecule. 

The purpose of the experiment is to measure the position of this air molecule at 
a certain instant when the air is thoroughly mixed, and we now wish to evaluate the 
probability that we will find this air molecule at a position defined by the height h 
(Fig. 20.5). The external conditions of this random experiment remain unchanged; 
all that has changed is the formulation of the problem. The result is the position h 
of the air molecule which can take on any value between 0 and Н. 

The position (reading the height) h is now a continuous quantity; it is a logical 
way of describing the outcome of the random experiment with h as the random 
variable. We have thus obtained a new type of random experiment. The outcomes 
and the random variable have a continuous set of values. This is always the case 
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Fig. 20.5 


when we consider a measurement as a random experiment in which the measured 
quantity varies continuously. 

An important and perhaps a surprising conclusion for continuous random vari- 
ables can be drawn from our example. The probability of finding the air molecule 
somewhere between 0 and H is 1 (by the normalising condition). The number of 
possible readings of its position is infinite for an arbitrarily small subdivision of the 
height H. It follows, therefore, that the probability at a definite position Ло has to 
approach the value zero. 


For continuous random variables it is not possible to specify a probability 
different from zero for an exactly defined value of a random variable. 


A value for the probability of the outcome of a random experiment for a contin- 
uous variable can, however, be given if we formulate the problem differently, i.e. by 
asking for the probability that the air molecule will be inside a given interval. 

What then is the probability of finding the air molecule in the interval ho to 
ho + Ah? (see Fig. 20.6). 

The total height of the cylinder is H. The probability P (ho < h < hg + Ah) of 
finding the air molecule in the interval ho and ло + AA is proportional to the interval 
Ah, i.e. 


1 
P(ho € h < ho + Ah) = g^ 
In general, the probability will also depend on the value of Ло considered. This will 


be the case when, for example, we take the influence of gravity into account. 
Hence, more generally, we have 


P (ho “Л € ho + Ah) = F (ho)Ah 
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Fig. 20.6 


F (ho) is called the probability density function and is as yet unknown. If the prob- 
ability density is known as a function of h, then we can evaluate the probability of 
finding the air molecule in the interval Ah for each value Ло. 

This concept of a probability density function has to be very clearly distinguished 
from the probability of a discrete distribution. The probability density is not quite 
a probability, but a probability per unit of the random variable. The probability itself 
can only be given for an interval of the random variable. The probability for an 
interval is then the product of the probability density function by the magnitude of 
the interval (for small intervals). 

Let us now determine the probability distribution in the case of the air molecule. 


Case 1: We will assume for the time being that the influence of gravity can be 
neglected, in which case the probability density function will not depend on ho. The 
probability of finding the air molecule in an interval of length Ah was 
Ah 
H^ F(h)Ah 
Hence the probability density function is F (A) = 1/ Н; it is independent of л. 
Case 2: We now take the influence of gravity into account. The density of the air 
decreases with height in accordance with the barometric equation (see Chap. 13, 
Sect. 13.2). 
It is shown in statistical mechanics that the probability density function for a gas 
molecule, taking gravity into account is given by 


1 
F(ih)=—-e%" fo О<А<Н 
C 
The constant c depends on the conditions of the gas inside the column: 
2209 
ро = density at h = 0, Po = pressure at h = 0, and g = acceleration due to gravity. 
c is a constant which is obtained by the normalisation condition, i.e. 


a 


A 
| F(h) dh=1 
0 
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Н 1 h 
and hence | -e?"gdh-l 
o c 
1 
from which we get с--(1-в 98) 
q 


In this case, the probability density function is a function of the altitude A. 

Let us denote the probability density function for an arbitrary random variable x 
by f(x). In order to determine the probability that x will assume any value between 
x1 and x2, we have to add up the probabilities of all the intervals between x; and 
Хэ. When the lengths of the intervals Ax tend to zero the sum becomes an integral 


and we have 
x2 


Р(х) = | f (x) dx 


ХІ 


All probability distributions have to satisfy the normalisation condition; for discrete 
probability distributions this is 


For a continuous distribution, the normalisation condition is 


ESTE 


20.2 Mean Values of Discrete and Continuous Variables 
Mean Value of Discrete Random Variables 


A class of students is given a test with the following results: 


Grade Number of students 


4 


холоо мыо н 
оноо + 


It is often very useful to characterise the performance of а class Бу means of 
a single number, although by doing so a lot of information is lost, since the individ- 
ual grades of the pupils are not included. Nevertheless, such a number, if carefully 
chosen, is sufficient in many cases. Such a number is the arithmetical mean value. 
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The arithmetical mean value is defined as the sum of all individual grades divided 
by the number of students. In our example, we have 


(1х4+2х4+3х6+4х2+5х1+6х0) 43 2053 
17 Е 
We encounter the same type of problem when we measure a physical quantity sev- 
eral times in succession and the measured values do not agree completely with each 
other, which is frequently the case in practice. In this case we also take the arith- 
metical mean value as the most probable value of the physical quantity. 
Now let us generalise. If we carry out a random experiment n times, the random 
variable may take on the values Хү, Хо, X3, ..., Хи. 


The arithmetical mean value x of discrete random variables is defined as 
1 n 
z— L Y х; 
nj 


Let the range of definition of a discrete random variable x be Хү, Хо, ..., xy. We 
then carry out a random experiment n times. If the random value x; appears with 
a frequency n;, then the mean value x of the random variable is defined as 

2014 
x= nj Xi 
nj 
If we know the probability distribution P (i) of a discrete random variable, then the 
mean value is defined as 


Mean Value of Continuous Random Variables 


If a continuous random variable is defined between the values xı and x2 and 
the probability density function is f (x), then the mean value of x is given by 
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0 5000 10000 150000 Н(т) 
Fig. 20.8 


The mean value of the random variable h (the height of the gas molecule in the 
cylinder shown in Fig. 20.7) is 


H рет 1 He @ 
(1—e-«H) а 1—e-0H 


In the case of an air molecule in the Earth's atmosphere, œ has the value 0.00018 m^ !. 
Figure 20.8 shows the mean height / of a gas molecule plotted against the alti- 
tude H. 

We notice that for small values of H , h = Н /2. This means that the mean value 
of the random variable A is in the middle of the height of the column. For Н >, h 
approaches the limiting value of 5400 m, which means that for arbitrarily large 


heights of the column / remains finite. 


20.3 The Normal Distribution as the Limiting Value 
of the Binomial Distribution 


A coin is tossed 10 times in succession. The probability of the outcome being a head 
k times and a tail (10 — k) times is given by the binomial distribution for a random 
experiment carried out n times with two possible outcomes. 
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Р(10) 


0-2 


0: 1-2 3 4 5 6 7-8 9 M. 
Fig. 20.9 


0-31-2.8 4 
Fig. 20.10 


For n = 10 and equally likely outcomes, the probabilities have been plotted as 
a function of k in Fig. 20.9. 

Such a distribution can be reproduced experimentally in the following manner. 
n rows of nails are positioned on a board in the form of a pyramid, as shown in 
Fig. 20.10. Each nail is exactly positioned in the middle of two nails belonging to 
the next lower row. Such a board is known as a Galtonian board, and the figure 
illustrates two such boards, one with 4 rows of nails and the other with 8 rows 
of nails. 

A marble (or a small steel sphere) is introduced in the tunnel at the top of the 
board and allowed to drop freely so that it impinges on the top nail centrally. It is 
then deflected to the right or left with equal probability, after which it impinges on 
a nail in the next row and so on through every row. Each time it hits a nail it is 
deflected to the left or right with equal probability. The sphere is then collected in 
one of the partitions shown in the figure. 
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If there are n rows of nails, then this corresponds exactly to n experiments with 
two equally probable outcomes. This leads to the binomial distribution. A marble 
reaches the kth partition if it has been deflected on k nails to the left and (n — k) 
nails to the right. The probability P(n; k) for this (cf. Sect. 20.3.2) is 


nm -(30) O 


If we allow many marbles to drop through the Galtonian board, we then find that 
they are distributed in accordance with the probability P (n; k). The number of mar- 
bles in the different partitions approaches the binomial distribution. (From a prac- 
tical point of view, we have to select the radius of the marbles and the distance 
between the nails in such a way that ideal conditions are obtained.) 

Figure 20.11 shows results obtained empirically with n = 4 andn = 8. 

We сап, of course, increase the number of rows, n, and the number of partitions. 
Figure 20.12 shows results obtained empirically with n = 24. 

In the limit, as n — œ, we will obtain a continuous function, as shown in 
Fig. 20.13. It is known as the normal distribution. 

The proof that the binomial distribution is transformed into the normal distribu- 
tion for n — ee is quite complex and beyond the scope of this book. 


0 1 2 3 4 
Fig. 20.11 
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Fig. 20.12 
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Fig. 20.13 


20.3.1 Properties of the Normal Distribution 


The analytical expression for the normal distribution, given without proof is 


Itis symmetrical with respect to the origin of the coordinates. Because of its shape, it 
is frequently called the bell-shaped curve. Because of symmetry, it has its maximum 
at x = 0. 

The parameter o defines the particular shape of the normal distribution. This is 
illustrated in Fig. 20.14 foro = 1, 2 and 3. 

When o is small the curve is narrow and elongated, i.e. it has a sharp maximum. 
The larger c the flatter and broader the curve becomes. Whatever the value of o the 
area under the curve remains constant. 

The normal distribution is a probability distribution and has to satisfy the nor- 
malisation condition, i.e. the sum of all probabilities must be equal to 1. In this case, 


we have i 
ын ны 1/ x52 
X ax = f — e72) dx =1 
TIO we 


The proof is given in the appendix to this chapter. 


f(x) 


Fig. 20.14 
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If we consider the normal distribution as the probability distribution of a random 
variable x, we can then compute its mean value. We have 


Here the normal distributed random variable has zero as its mean value. 
A new normal distribution is obtained if we shift it along the x-axis by an 
amount u (Fig. 20.15). This is achieved by replacing x by (x — jz) so that 


This function, of course, has its maximum value at x = u. The mean value of the 
random variable in this case is X = u, which should be fairly obvious. 
The parameter o is called the standard deviation. It is given by 


о2-| (хш?) dx 


o determines the width of the normal distribution curve. It is a measure of the vari- 

ation of the random variable x about its mean value. Within the interval и +o and 

и — с lies 68% of the area beneath the normal distribution curve (Fig. 20.16). Its 

meaning will be discussed further in Chap. 21 which deals with the theory of errors. 
The normal distribution is symmetrical about its maximum value at x = p. 


f(x) 


Fig. 20.15 Fig. 20.16 
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20.3.2 Derivation of the Binomial Distribution 


There are some random experiments for which there are only two possible out- 
comes, e.g. tossing a coin. 

Let us denote the two possible outcomes of an experiment by A and B, and 
let P(A) and P(B) be the probabilities for the occurrence of events A and B, 
respectively. Since A and B are the only two elementary events in this random 
experiment, we must have 

Р(А)-Р(В)-1 


We now carry out this experiment N times in succession, e.g. we toss a coin N 
times. What is the probability of event A occurring Na times in N experiments? We 
call this probability P(N; Na). We are not concerned here with the order in which 
the Na events occur. 

The solution can be divided into three steps. 


1. We compute the probability for the occurrence of a particular outcome with NA 
events А and Ng events B, so that N = Na + Ng. 

2. We obtain the number of particular outcomes which differ only in arrangement, 
not in NA. This is the permutation discussed in Chap. 19, Sect. 19.3.1. 

3. We evaluate, from the probability of one particular outcome and the number of 
equally probable outcomes, the probability that out of N experiments Na will 
lead to the event A. 


Step 1: We pick out one particular outcome, perhaps the one for which event A 
occurs NA times, and after that event B occurs Ng = (N — Na) times, i.e. 


AAA...AA BBB...BB 
w m 


NA times Ng times 


The elementary event A has the probability P ( A). The compound probabil- 
ity for the occurrence of Na events A is 


P(NA) = P(A) x P(A) x- x P(A) = P(A)NA 


The elementary event B has the probability P(B). The compound proba- 
bility for the occurrence of Ng = N — Na events B is 


P(Ng) = P(B) x P(B)x---x P(B) = Р(В)\ = p(B)N-N^ 


The compound probability for the simultaneous occurrence of Na events А 
апа Ng = № — № events B is then the product 


P(A)"^ P(B)NB = P(A)P^ p(B)N Na 


We have thus determined the probability for the occurrence of one particular 
outcome of the test. 
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Step 2: In Chap. 19, Sect. 19.3.1, we determined that the number of permutations 
of N elements in which Na and Ng are equal is 


N! N! N 
Na!Npg!  Na!(N—Na)! — 13 
Step 3: Each of the permutations in Step 2 has the same probability of occurring. 
The probability that one of these permutations may occur is (in accordance 
with the addition theorem for probabilities) obtained by summing the prob- 
abilities of all the permutations. 


Hence, with the results obtained in Steps 1 and 2, the probability P(N; NA) that 
out of N random experiments NA will have the outcome A is given by 


P(N;Na) = WN xg M" Panos = ra P(A)Na P(B)N-Na 


This also means that out of N experiments those that remain, namely Np, will have 
the outcome B. For a preassigned №, P (A) and P (B), this probability is a function 
of Na. The range of definition of this function is the set of integers from 0 to №. For 
each Na, we can calculate the accompanying probability using the above equation. 
The totality of these probabilities is called the binomial distribution. 


Note: Repeating an experiment N times can be replaced by carrying out N equal 
experiments simultaneously, e.g. tossing one coin N times is equivalent to 
tossing N coins simultaneously. 


Summary 
Suppose an experiment has two possible outcomes, A and B, which occur with prob- 


abilities P(A) and P(B). If we carry out N such experiments, then the probability 
for the occurrence of Na events A is given by the binomial distribution 


P(N;NA)- ЇЕ ) РОА) P(B)N-Na 


Appendix 


Evaluation of the integral 


First substitute 
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Then Ш 
dx = V20 dz 


Hence the integral becomes 


1 оо 
Iw]. dz 


To integrate this expression we use a 'trick': we multiply the expressions 


I es 1 ee 
1 = Ис dz and I= 227 4и 


So obtaining the double integral 


p.c (8 e Gne) dz du 
Te pte РЕҢ 


Now we introduce polar coordinates: 


rl, 2 —rcosQ 
u=rsing 


The differential area dz du becomes r dr d$. We integrate ф from 0 to 27 and г 


from 0 to ce and obtain 
1 оо (2m 
P= J y er dr dọ 
л Jo Jo 


The integration with respect to ф gives 
2л 
аф = 2л 
0 


so that Bao re? dr 
0 


The solution can be verified easily 


(The value J = —1 is ruled out, since the integrand assumes positive values only.) 
If we now insert in the integral и = 0 and o = 1/ V2 we obtain 


oo 


1 оо 
mie dx =1 or [oe dx = ут 
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Evaluation of the integral 


1 oo 
nm 
ON 2m J->% 


l/íx— 5 
8462) 
2 o 


Then (x — u)? = 20222 and dx = V20 dz. 


First substitute 


The integral 


1 z _l(x-u 
е E 25 ) dx 


is transformed into 


Integrating by parts gives 


DG? qu gs 9 20? 1 2] Буи 1 32 
mE = d БЭЭР EV ae -2 -Zf EM d 
Vda ect cue " 


From above we know that 


Hence 


Exercises 


1. Two dice are thrown. Calculate the mean value of the random variable ‘sum of 
the number of spots’. 


2. A random variable has the probability distribution 


E 0<х<2 
Р(х) = 4 2 
0, 


otherwise 


Compute the mean value of the random variable. 
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3. 60% of the students who start to study for an engineering degree complete their 
studies and obtain a degree. What is the probability that in a group of 10 arbi- 
trarily chosen students in the first term of study 8 will obtain a degree? 


4. Evaluate the mean values of the random variable x which have the following 
normal distributions: 


1 2 1 2 
= -(x-2)^/18 m —(x+4)?/2 
(a) f(x) RU e (b) f(x) a e 


Chapter 21 
Theory of Errors 


21.1 Purpose of the Theory of Errors 


The theory of errors is a part of mathematical statistics and deals with the follow- 
ing facts. 

Given the results of measurements carried out in a laboratory, we require state- 
ments about the ‘true’ value of the measured quantity and a prediction of the accu- 
racy of the measurements. 

There are two types of errors which arise when we carry out a measurement: 
systematic or constant errors and random errors. 

Constant errors are errors generated in the measuring instruments or in the 
method of measurement. They always bias the result in a particular direction so 
that it is either too large or too small; they arise through wrong calibration of the 
measuring instrument or not paying attention to secondary effects. An example of 
a constant error is often found in the speedometer of a car, sometimes as a result of 
design. The speed indicated is frequently found to be 5% above the ‘true’ speed of 
the car, but this can vary from 0 to 7% in practice. 

Constant errors can only be avoided by a critical analysis of the measuring tech- 
nique and of the instruments, and such errors cannot be discovered with the help of 
the theory of errors. 

Random errors are due to interference during measurements, so that a repetition 
of measurements does not give exactly the same results, i.e. the measured values 
vary. For example, if we weigh a body repeatedly we will always obtain a differ- 
ent result. Although we take great care on each occasion, we are not able to read 
each time exactly the same position of a pointer between two very fine marks. Fur- 
thermore, the pointer itself does not always settle at the same position. Random 
errors are the result of a multiplicity of interference factors like the fluctuation of 
the boundary conditions which had initially been assumed to be controllable (tem- 
perature, air pressure, voltage fluctuations, shocks and errors of observation). 

To avoid random errors we must naturally improve the method of measurement. 
This leads to more reliable measurements but does not solve the fundamental prob- 
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lem. The influence of random errors can be limited, and the accuracy of measure- 
ment may even be improved in powers of 10; however, each instrument has limited 
accuracy and hence random errors will always appear. 

The purpose of the theory of errors can now be formulated more precisely. 

From the measured values we want to be able to infer the ‘true’ value of the 
measured quantity and estimate the reliability of the measurement. Each reading in 
an experiment is made up of a hypothetical ‘true’ value of the measured quantity 
and an error component: 

x=T+E 


where x = measured value, T = ‘true’ value free of errors, and E = error compo- 
nent. Furthermore, 
Е = Еу + Ез 


where E, = random error which can be estimated by repeated measurements, and 
Ез = constant error. 


21.2 Mean Value and Variance 


21.2.1 Mean Value 


The acceleration due to gravity, g, is to be determined experimentally. The time 
of fall of a sphere is measured with a stopwatch and the distance with a tape mea- 
sure. In order to increase the reliability, the measurements are repeated in a series 
of readings. A series of readings comprising 20 measurements is considered to be 
a random sample of all possible measurements for this experimental set-up. The 
arithmetic mean value of n measurements is taken as the best estimate for the ‘true’ 
value. We have х 

È х} 

7-1 


If individual measured values occur repeatedly, the mean value can be expressed in 
terms of the frequency h; with which they occur, in which case we have 


х= 


S|— 


k 
x= У hi xj 
i=l 


where n; is the frequency and h; = n; /n is the relative frequency of the measured 
value x;. If n — ©, the relative frequencies become the probabilities P(i) (see 
Chap. 19, Sect. 19.2). 

The sum of all deviations from the arithmetic mean value vanishes, i.e. 
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21.2.2 Variance and Standard Deviation 


Individual measurements deviate from the mean value partly because of random er- 
rors. These deviations become smaller as the measurements become more reliable 
and more exact. They thus enable us to predict the reliability of the measurements 
and the magnitude of the random errors. In order to draw this conclusion, we have 
to define a measure of dispersion. Let us start by considering the deviation of the in- 
dividual measurements from the mean. Since positive and negative deviations from 
the mean cancel out, their sum is zero. However, to obtain positive values we square 
the deviations. A suitable measure of dispersion is obtained by taking the mean 
value of the square of the deviations. Such a measure is called variance. 


Definition The variance is the mean value of the square of the deviation: 
142 2 
S? = 1 У(хс-х) (21.1) 


i=1 
If a frequency distribution exists, then the variance becomes 
2 k 2 
St = У hj (xj ын X) 
7-1 


The unit of variance is the square of a physical quantity. 
The measure of dispersion of the physical quantity is obtained by taking the 
square root of the variance and is called the standard deviation. 


Definition Standard deviation: 


(21.2) 


Meaning of Standard Deviation 


For a large number of measurements, about 68% of all measured values of x will lie 
within the interval 
X-S<x<x+S 


The mean value and the variance are related to each other. We demonstrate below 
that for the arithmetic mean the variance (and with it the standard deviation) assume 
a minimum value. 


Proof We take the variance as the mean value of the squares of the deviations from 
some independent reference value X so that 
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We wish to determine X in such a way that the variance assumes a minimum. Ac- 
cording to the rules of differential calculus, we find the minimum if we set 


d 
—(S?)=0 
as 
.e. — |- j—X = ; —X)(—1) = 0 
ie E (iLe) =2 0-9-0 
ог Ух; - п = 0 
1 
Hence Х = – Ух; = X , the arithmetic mean value . 
n 


Now we have to check whether there is, in fact, a minimum for this value of x. 
We find 


— (S?) =2, a positive value . 


Hence the variance is a minimum for X = Х. 


21.2.3 Mean Value and Variance in a Random Sample 
and Parent Population 


A random sample contains n measurements of one quantity. We consider the mea- 
surements as a random selection out of the set of all the possible measurements 
in an experiment. This set is called the parent population. It is always larger than 
the random sample. The parent population, like the random sample, is characterised 
by a mean value and a variance. The mean value for the parent population is the 
hypothetical ‘true’ value. The values which relate to the parent population can be 
estimated on the basis of random sample data. The larger the random sample is, the 
more reliable the estimate becomes. 
Quantities which relate to the random sample are denoted by Latin letters, i.e. 


X = mean value 


S? = variance 
and those which relate to the parent population are denoted by Greek symbols, i.e. 


u = mean value 
2 


0^ = variance 
The estimates for the ‘true’ values for the parent population are the ones we are 
interested in, and the results of measurements are estimates of the unknown ‘true’ 
values. We give below, without proof, the most important formulae for these esti- 
mates. You will find detailed explanations and proofs in textbooks on mathematical 
statistics. 
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Best estimate of the arithmetic mean value 


214 
их = = Уух) (21.3a) 
1-1 
Best estimate of the variance 
CE NC NUS TD 
c aS = У, (9 x) (21.3b) 


i=1 


This estimate for the parent population is larger than that for the sample by the 
factor Gop: for sufficiently large n this factor tends to the value 1, and hence S? 


can be used as an estimate for o?. 


Best estimate of the standard deviation 
n n n 
= S = -—X 2 
j Ta (do 3) 


Example The diameter of a wire has been measured a number of times, as shown 
in the first column of the table below. Calculate the mean value and the standard 
deviation for the parent population. Columns 2 and 3 show the required workings. 


Diameter (x; — X) (xj = X)? 

xj (mm) (mm) (mm)? 
14.1 x 102 —0.1 x 1072 0.01 x 10-4 
13.8 x 1072 —0.4 x 1072 0.16 x 1074 
14.3 x 1072 0.1 x 1072 0.01 x 1074 
14.2 x 1072 0 0 
14.5 x 10-2 0.3 x 1072 0.09 x 1074 
14.1 x 1072 —0.1 x 1072 0.01 x 1074 
14.2 x 1072 0 0 
14.4 x 10-2 0.2 x 1072 0.04 x 1074 
14.3 x 1072 0.1 x 1072 0.01 x 10-4 
13.9 x 1072 —0.3 x 1072 0.09 x 10-4 
14.4 x 102 0.2 x 10-2 0.04 x 1074 

Sum 156.2 x 1072 0 0.46 x 10-4 
1. We calculate the mean value of the sample: 
;. 1562x102 
x = d = ——— = 0.142mm 


11 
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2. Now we calculate the variance and standard deviation of the sample: 


|. 0.46 x 107“ 
B 11 
S = 0.20 х 1072 = 0.002mm 


S? = 0.042 x 1074 тт2 


3. Mean value, variance and standard deviation for the parent population: 


х = 0.142mm, as for the sample . 


11 
o2 S2 = — x 0.46 x 1074 = 0.046 x 1075 
п-1 10 


o = 0.21 x 1072 = 0.0021 mm 


21.3 Mean Value and Variance of Continuous Distributions 


The concepts of variance and mean value can be applied to continuous distributions. 
Let the probability density function p = f (x) of a distribution be given. 
We know that for discrete samples the mean value is 


We replace the discrete frequencies h ; by the probability density function p = f (x) 
and proceed to the limit as k — со, The sum becomes an integral: 


Mean value u of a continuous distribution 


u-[ xfQ) dx (21.4) 


The variance for a given frequency distribution of the sample gives the best esti- 
mate of the variance for the parent population. We have seen that it is given by 


n 
o? = У hx - xy 


In the limit, the variance for a continuous distribution becomes 


а? = Js (x — u)? f (x) dx (21.5) 


Example A factory manufactures bolts for use in the construction industry. Be- 
cause manufacturing processes are not perfect not all manufactured parts are abso- 
lutely identical. In the case of the bolts, their diameter x is a random variable. We 
assume that the density function has the following distribution: 
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ғо) a 031.17), 09«x«11 
; otherwise 
The constant A is unknown. 
Determine the value of the constant A, plot f (x) graphically and calculate the 
mean value jz and the variance o?. 
First let us calculate A. 
From Sect. 20.1.2 we know that all probability distributions have to satisfy the 


normalisation condition, i.e. the sum of all probabilities has to equal 1. Hence 


11 
Р(х) ах =1 
0.9 
11 1 
ie. | (x —0.9)(1.1—x)dx = — 
0.9 A 


Integrating and solving for A gives A — 750. 

Figure 21.1 shows tabulated values of the function and its graph. 

The function is symmetrical; in fact it is a parabola. We see from the graph that 
ш must be equal to 1. Let us check this with our definition: 


1.1 
u= 750 | x(x —0.9)(1.1 — x) dx 
0.9 


1 poe)" 
x4 x2 


=1.0 
4 2 9.9 


2 3 
= 750 
B 


The variance is o? = 750 (14 (x — 1)?(x — 0.9) (1.1 — x) dx = 0.002 


x f (x) f(x) 
8-0 

0.90 0.0 

0.92 2:7 

0.94 4.8 6-0 

0.96 6.3 

0.98 7.2 4.0 

1.00 7.5 

1.02 3:2 

1.04 6.3 2:0 

1.06 4.8 

1.08 2.7 d 

1.10 0.0 0-2 0-4 0-6 0-8 1-0 1.2 x 
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21.4 Error in Mean Value 


Up to now we have considered the mean value of one random sample of n mea- 
surements to be the best estimate for the ‘true’ value. We have not yet answered the 
question concerning the error in such an estimate. 

Consider now several random samples (series of readings) of the same number 
n in the same parent population. The mean values of these random samples are 
also scattered around the ‘true’ value. We need to know whether the dispersion of 
the mean values is smaller than that of the individual values. This dispersion of 
the mean is most important to us, since it determines the reliability of the result of 
a series of readings. 

Let ow? be the variance of the mean values of random samples, the variance of 
the individual values being o?. 

2 o? 


The variance of the mean values is then given by ом“ = um: 


The standard deviation of the mean values is Ом = Уй 
The standard deviation of the mean values is a measure of the accuracy of the 
mean values of a series of readings. It is referred to as the sampling error or the 


mean error of the mean values. 


Definition The standard deviation of the mean value, the sampling error 
(mean error of the mean value) is 


(21.6) 


The accuracy of the measurements can be increased by increasing the number of 
independent measurements. If the sampling error in the mean value is to be halved, 
for example, then the number of measurements has to be quadrupled. 


Example The example in Sect. 21.2.3 concerning the diameter of a wire gave the 
following results: 


Arithmetic mean value x = 0.142mm. 
Standard deviation o = 0.0021 mm. 


Mean error of the mean values: 


0021 
Rie ШИН аврахын 


vn Уй 


Hence d = 0.142 + 0.0006 mm. 


Confidence Intervals 


Although the mean value x obtained is an estimate of the ‘true’ value џи, we also 
need to take into account the sampling error. 
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Since it is reasonable to suppose that the sample means are normally distributed, 
we can expect the ‘true’ value to lie with a probability of 68% in the interval X + low 
and with a probability of 95% in the interval x + 20у (see Sect. 21.5). 

These intervals are called confidence intervals. The results of measurements are 
usually presented by quoting the mean value and the sampling error. 


21.5 Normal Distribution: Distribution of Random Errors 


We have assumed that random errors are normally distributed. This assumption can 
be rendered plausible on the basis that random errors arise as a result of the super- 
position of many very small sources of errors referred to as elementary errors. The 
individual errors in measurements arise as a result of these elementary accidental 
errors like the deviations of the marbles in the Galtonian board. Starting with the 
hypothesis of a large number of uncontrollable statistical interference factors, Gauss 
showed that the probability function of the distribution of measured values can be 
described by the normal distribution: 
-— 20 


f(x) = aX E2 


The usefulness of this model has also been proved empirically. In many cases, 
the errors in measurements are actually distributed in accordance with this normal 
distribution around their corresponding mean value. The larger the number of mea- 
surements the more closely the distribution approaches the Gaussian normal dis- 
tribution. The measurements are dispersed around the ‘true’ value u. The standard 
deviation is o. 

Using the value of the standard deviation, we can specify the percentage of the 
measured values which can be expected to fall within a given interval in the neigh- 
bourhood of the mean value, as shown in Fig. 21.2. 

A deviation of more than +30 from the mean value can only, on the average, oc- 
cur once in 300 measurements by chance. This means that practically all measured 
results fall inside the interval u + 3o. 

The mean values of random samples are always normally distributed, the dis- 
persion being of course smaller. As already explained in Sect. 21.4, the variance 
of the mean values decreases as n, the number of measured values, is increased 
since 


(standard deviation of the mean value) 


Oo 
OM = — 
MU уп 
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Fig. 21.2 


21.6 Law of Error Propagation 


Often we have to distinguish between fundamental quantities (length, mass, time) 
and derived quantities (specific weight, velocity, acceleration, force, pressure etc.). 
Many physical quantities cannot be measured directly, since they are calculated from 
several other measured quantities. For example, specific weight is calculated from 
the measured quantities weight and volume. Thus the sampling error in the specific 
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weight is made up of the sampling errors which arise in the measurement of the 
weight and the volume. 

If we assume that some physical quantity g cannot be measured directly but is 
a function of two measurable quantities x and y we have 


g=f (x,y), x and y being independent. 


We can then measure x and y and obtain the mean values x and y and the stan- 
dard deviations бмх, ому. The standard deviation оме of the quantity g is then 
calculated by the application of the Gaussian /aw of error propagation given here 
without proof. 


Law of error propagation 


9 гү? 9 гү? 
ом. = TOS 0M, ? + (52) ом,2 (21.7) 


The partial derivatives are calculated at the values x and y. 


Example A coil having 500 windings is made of copper wire having a diameter 
D = 0.142 + 0.0006 mm and a total length L = 94290 + 30mm. The specific resis- 
tance of copper is 


p=1.7x10°Qmm 


If A is the cross sectional area of the wire, then the total resistance R of the coil is 


L 401 
R-P47 ар? 
_ 4x 1.7 x 1075 x 9.4290 


= 101.20 


л (0.142)2 


We need to know the standard deviation омв of this total resistance. We have 


OR 4p 


— = SS = -3 
ЭГ” zpi-932x10 
OR 801. 
EE nn qas 
ор л 3 28 


Thus 


Our = (9.32 x 10-3 x 30)? + (1425.6 x 6 x 10-52 x 10 


Hence the total resistance of the coil is А = (101+1)Q 
This value would then be quoted by the manufacturer. 
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21.7 Weighted Average 


It frequently happens that a physical quantity can be determined by two different 
measuring methods. The results of the measurements usually deviate from each 
other. We then have two results whose true value lies with a probability of 0.68 
in the confidence intervals 


X1 OM, and X2 OM, 
A better estimate for the ‘true’ value is obtained if we combine both measurements. 
We do not calculate the arithmetical mean value of the individual values but *weigh' 
the measurements. A simple arithmetic average would not give the best value be- 
cause it would mean that we were giving equal importance to both values. 
The measurement with the smaller sampling error has a larger weight. The fol- 
lowing expression is called the weight. 


The weighted average (weighted mean value) is given by 
81Х1 t 82%2 
§1 — 82 


It can be shown that in this expression the sampling error of x has a minimum. The 
expression can be generalised for more than two readings. 


wa 


Weighted mean value: x= ie i e2a 
81+ 82 
А 1 
Weight: 8 = —5 
OM; 


Example The diameter of the wire in the example in Sect. 21.2.3 is measured by 
a second method with the following results: 


d, = 0.1420 + 0.0006 mm 
də = 0.1410 + 0.001 mm 


The weighted mean value is obtained in the following way: 


1 1 
= = = 2.778 x 106 
581752 (6x 10-4)? 5 


1 1 
= —~-—____ ~ 106 
$275 (1039 


6 6 
Hence d 2.778 x 10? x 0.142 4- 10? x 0.141 —0.1417mm 
3.778 x 106 
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21.8 Curve Fitting: Method of Least Squares, Regression Line 


We have so far considered measurements which were related to one physical quan- 
tity. We will now consider experiments which investigate the relationships between 
two physical quantities, x and y, where the quantity y is measured and the quan- 
tity x is varied. 


Example The graph of the cooling curve for a liquid has to be determined. 

We measure the temperature at intervals of 30 seconds and plot a graph of tem- 
perature versus time. The experiment is repeated n times and we then calculate the 
mean value and the standard deviation at each time interval and plot these, as shown 
in Fig. 21.3. 

We now draw the best curve through the measured values, i.e. the curve which is 
as close as possible to the measured values. This method is called curve fitting. 

To obtain the best fit we use the method of least squares. In this method the 
squares of the distances of the measured points from the curve are a minimum. 
Before we can begin, however, we must decide on the type of function that will best 
fit the experimental data. In general; this function will depend on the underlying 
theory. The types of functions most frequently used are straight lines, log functions, 
parabolas and exponential functions. 

We propose to demonstrate the method using a straight line relationship. The task 
is to determine, from the given data, the coefficients a and b in the equation 


y=ax+b 


assuming it to be a first approximation to the experimental data. It is called 
a regression line. 


aa 
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Let y=ax+b 
n 
У xiyi - nxy 
Then a= and b=f-ax (21.8) 
Y xi2-nx? 
1-1 


The straight line passes through the point (X, y) which is the ‘centroid’ of the 
test data. 


Proof 


Px, y) 


Fig. 21.4 


The vertical distance d; from the test point P; (x; y; ) to the straight line is 
dj = |у — (ax;  b)| = |y; — axi — b| 
Summing up the square of the distances for all the points, і = 1, 2, 3, ...,n, gives 


S 2 Y dj? => (yi - axi - b 


We now consider a and b as variable quantities and determine their values such that 
S will be a minimum. 
For S to be a minimum we must set 


ða 93b 
Hence 2 =0 = Y2(y; - ax; - b)(-xi) 
0 — Y iyi — axi? — bxi) = xiyi -аУ xi? - Y bxi 
os 
3p uem Y2(yi - axi - b)(-1) 


= Y (y: -axi -b)- Уу -a $ xi — bn 


By definition of the arithmetic mean value: У х; = nx and Y у; = ny 
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We thus have the two following equations in the two unknowns a and b: 


У х;у: = а У х2 пх =0 


ny —anx – Ьп = 0 (21.9) 
Substituting and solving gives 
b = ӯ-ах 
а 228i — nx (21.10) 
Yx;j2-nx? 


Thus we have determined the coefficients a and b of the regression line. 
The straight line y — ax 4- b is the best fit for the experimental data points. 


Example The values of the temperature during a cooling experiment of a liquid at 
intervals of 0.5 min are as follows: 


Time, t Temperature, T 
(min) (°C) 
0 62 
0.5 55 

1 48 
1.5 46 

2 42 
2.5 39 

3 37 
3.5 36 

4 35 


As a crude approximation, we propose to fit a straight line, i.e. T = at +b. 
To obtain the values of a and b according to the method of least squares, we 
proceed in accordance with the following scheme. 


1. Tabulate the test data and calculate the necessary sums and products according 
to (21.10), as shown in the table. 
2. Calculate the mean value f and T: 


t=2min, T = 44.44°C 
3. We substitute the values in the equations for a and b. This gives 


YX5T;-niT 703—9x2x 44.44 
a= = 


хант  51—9x22 
= —6.47°C/min 
and b = T — af = 44.444 6.47 x2 


= 57.4°C 
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ti 15 Т, t; T; 
(min) (min?) (*C) 
0 0 62 0 
0.5 0.25 55 27.5 
1 1 48 48 
1.5 2.25 46 69 
2 4 42 84 
2.5 6.25 39 97.5 
3 9 37 111 
3.5 12.25 36 126 
4 16 35 140 
by 18.0 51.0 400 703 
SM Regression line 
60-1 T = (-641t + 57.4)°C 


— L1... 
40 : Lr IE, 


Fig. 21.5 


Hence the regression curve is 


T = (—6.47t + 57.4)°C 
It is shown in Fig. 21.5. 


21.9 Correlation and Correlation Coefficient 


Let us consider two sets of measurements of two variables x and y which are graph- 
ically represented in Fig. 21.6. The first impression is that the variables x and y are 
correlated. In fact, if we calculate the regression lines they both lead to the same 
line. But it is obvious that the first set, A, suggests a stronger dependence between 
the variables than the second set, B. In the first set, we might assume a clear-cut 
functional dependence between x and y which is distorted by random errors of 
measurement. 

In the second set of measurements, we might say that there is a correlation be- 
tween x and y, but there might be other variables too which influence y. Empirical 
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Our task is to find a measure which describes the magnitude of the correlation 
between two variables, while a variance is superimposed, the details of which we 


data of this kind often occur in biology, psychology, meteorology, and economics, 
do not know. 


but they also occur in technology and physics. 


Fig. 21.6 
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This measure is given by the correlation r?, or the correlation coefficient r. 
They are defined by 

2 

(z хуг —- nx 7) 
Correlation r= 
(z? Е 23 (зэ? = 23 
Correlation DXi yi — NX 
coefficient dm 
(хх Е nx?) (зэ? = ny?) 


where x;, y; = measured values, and y, x = mean values. All sums are formed 
from i = 1 toi =n. n = number of measurements. 


Note that the correlation coefficient can take on values between —1 and +1. We 
get r = 1 if all points (x;, yj) lie on a straight line with a positive slope, i.e. the 
regression line. 

We get r = 0 if x; and y; are statistically independent, e.g. if the dots lie in 
a circular-shaped cluster. 

r is negative if the regression line has a negative slope, indicating that when x 
increases y decreases. 

The definition of the correlation coefficient may seem quite arbitrary. It may be 
useful to know how we arrive at this concept. 

We established the regression line by means of the condition that the sum of 
the differences between y;-values and the adjunct points of the regression line ў; 
should be a minimum. There remains a variance 1/n У, (y; — f;)? which is only 
then zero if all y;-values coincide with the regression line. Consider the variance of 
the y;-values with respect to the mean value y. It is obviously greater. 

A measure which describes the quality of the regression line as an approximation 
of the given values is the relative reduction of variance achieved by the regression 
line. The amount of the reduction of variance is called the explained variance. The 
fraction of variance thus explained relating to the regression line with the original 
variance relating to the mean value is called the correlation r?. Its root is the corre- 
lation coefficient r (often called the product moment correlation). 

The correlation can be obtained in the following way. 

Let the variance with respect to the mean value be 


The remaining variance with respect to the regression line is 


У (у= Fi)? 
n 


Se= 
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Thus the explained variance is 
Se = UM — Vr 
The correlation is given by 
2_ Se _ EQi 30 -XGi - 91)? 
5м У (уг = Ӯ)? 


If the formula for the regression line is inserted, it can be shown with an elementary 
but cumbersome calculation that the given formula holds true. 


Exercises 


1. In the following examples, state whether the error is systematic (constant) S or 
random А. 


(al) A 100-metre race is held in a school during a sports day. The judges start 
their stopwatches when the sound of the starting pistol reaches them. What 
type of error arises in this case? 

(a2) The timing of the start and end of the 100-metre race is subject to individ- 
ual fluctuations, e.g. reaction time. 

(b) The zero point of a voltmeter has been wrongly set. The measurements аге 
therefore subject to an error. What kind of error is it? 

(c) The resistance of a copper coil is obtained by measuring the current flow- 
ing through when a voltage is applied to it. As the coil warms up the resis- 
tance increases. What kind of error will ensue? 


2. (a) Nine different rock samples are taken from a crater on the moon whose 
densities are then determined with the following results: 


3.6, 3.3, 3.2, 3.0, 3.2, 3.1, 3.0, 3.1, 3.3g/cm? . 


Calculate the mean value and standard deviation of the parent population. 
(b) The velocity of a body travelling along a straight line is measured 10 times. 
The results are 


1.30, 1.27. 1.32, 1.25, 1.26, 1.29, 1.31, 1.23, 1.33, 1.24m/s. 


Calculate the mean value and the standard deviation. 


3. A continuous random variable has the following density function: 


rady 0<x<1 


otherwise 


Calculate the mean value and the variance. 
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Determine the confidence intervals Х + ow and х + 20у for exercises 2(a) 
and (b). 


. A measured variable is normally distributed with a mean value u = 8 and a stan- 


dard deviation ос = 1. What percentage of all test data are smaller than 7? 


. (a) The sides of a rectangle are x = 120+ 0.2cm and y = 90+0.1cm. Calculate 


the area and the standard deviation. 
(b) Calculate the density and the standard deviation of a sphere of diameter 
6.2+ 0.1 mm and mass 1000+ 0.1 g. 


. The sensitivity of a spring balance is to be determined. To achieve this we place 


different masses m on the balance and record the deflection S. The results are 
mass (mg): 2000 3000 4000 5000 6000 
deflection (mm) 16 27 32 35 40 


If a straight line is to be fitted through these data points, i.e. 5 = am + b, calcu- 
late the values of a and b. 


. An angle has been measured several times with two theodolites and the fol- 


lowing values were obtained: 73?2/7" + 10' and 73?2'12" + 20". Calculate the 
weighted average. 


Answers 


Chapter 1 3. 


1. Vectors are: acceleration, centripetal force, 
velocity, momentum, magnetic intensity. 


2. The order in which the vectors are added is 
immaterial. 


7 


Fig. 1.37 


o M 


Fig. 1.40 
(b) 


yÁ 


» 
x 


а, 


Fig. 1.42 


Answers 


(b) 
УА 


Fig. 1.43 


6. (a) |ap|= |a| cos 60° =5x } = 2:5 


12. 


13. 


(b) |ap|=0 
(c) |ap|=4 
(4) |ap|= |3 cos(x — F)| 
=|3(-cos $)| = 2 
P—(10,—2) Steps in the computation: 
— 
a = Pı P2 = P2 -P1 (5, 2) 
b = P3 -P1 = (3, —5) 
Ра =P; +a +b = (10, —2) 
— 
a = РР = (x2 =- X1, y2- X1). 
b=(x3- х2, Ya — y2), 
€ —(X4— х3, Y4— Уз), 
d —(x1—X4, Y1— ya), 
S —(x5 X] X3 — X2 + X4 — X3 
TX) = х4, 
У2- У1 + Уз – У2 + уд – Уз 
+y1 — y4) 
=(0,0)=0 
Е = (90N, 10N) 
(a) (4, 3, 0) (b) (5, 3, 7) 
(а) а= (-144, -1, 0) 
(b) a= (-7, 22, 12) 
_3 -1 2 
(8) ea = Tre aa? ia 
(b) ea = 2, – 5, -3 
(а) |а| = v24 = 
| 
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14. (а) Vi + V2 = (0,250) km/h 8. lax b| - ab sina 
(b) V4 + V3 = (50,300) km/h 5 
(c) Vi+ V4 = (0,350) km/h (a) 2x 3x 5 25.196 
(d) |V1 4- V2| 2 250km/h (b) 0 (c) 6 
(е) |V1 + V3| = 304 km/h 
(f) [V4 + V4| = 350km/h 9. (a) -3с (b) 3b (c) -$b 
(d) —6a (e) 0 (f) 6a 
10. e = (aybz —azby, azby — axbz, 
Chapter 2 dxPy аур) 
(a) e — (10, —9, 7) 
1 (b) c = (3, 6, —9) 
1. (а) a-b=ab cosa —3x2x 5 = 3 
(b) a- b — 10 
(c) a.b 22x y2 ~ 2.828 
d) a-b = —3.75 
e Chapter 3 
2. (a а= 5 Vectors are perpendic- 
ular 
(D «-0 Vectors are parallel 1. (a) 
(с) а= 5 
(d 5 <а<л 
3. (а) a.b = 3—2 +20 = 15 
(b) a.b = —1.25 
(c) а-5--1 
(4) a-b=4 


4. (a) a-b=0 Vectors are perpendic- 


5. 


6. 


ular to each other or one vector- 

is zero 

a:b ——12 ais not perpendic- 
ular to b 

a-b=0 a is perpendicular to b 


(b) 
(c) 


(d a.b —0 ais also perpendic- 
ular to b 

(e) a-b ——1 ais not perpendic- 
ular to b 

(0) a-b —0  aisalso perpendic- 
ular to b 

cos @ = ab 2. 

(а) cos œ ——1 thusa=z 

(b) cos a = Е thus œ ~ 48° 

О =Е.5 

(а) 15 Мт (6) 5Nm (с) 0 


(а) с is parallel to the Z-axis 
(b) c is parallel to the x-axis 


(b) 


(a) 


Zeros: x ——1,x =3 
Poles: none 
Asymptote: none 
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(b) 


Zeros: none 
Poles: x =0 
Asymptote: x-axis 


-2 4 


Zeros: none 
Poles: none 
Asymptote: x-axis 


(d) 


9. (a) sin (u-- 7) (b) їл 


2 
1 
Zeros: none (c) 27 
Poles: x = 0 
Asymptote: y = x (This is the streight 10, (a) sin79° (b) sin3° 
line. It intersects the first and third e (mL . f1 
quadrant.) (c) sin( 7.) (d) ч (65) 
3. (a) 0.017 (b) 2.09 
(c) 0.785 (d) 7.19 1. @.1 (b) tang 
(c) — tan? o d 2 
4. (a) 5.739 (b) 102.56° ое 
(с) 12.61° (d) 130.06? А 
(е) 54.43° (f) 180° D. (a) 2,510 01 cos w2 эээ 


2sin@, COs 02 


95 
(a) (b) 2cos45° cosa = V2cosa 
2 
cos^ Q 1 
NES er CEU DTE t 
(© 2singcos@ аб 
3 3 
13. sin | sin Z = 23 2sin 7 cos 7 = 2 


Answers 
Chapter 4 
L@t æ 
a 
(d) 1 (e) 
(g) 10? (b) 
2. (a) V2 (b) 
(d) v35 (e) 
3. (a) 2 (b) 
(d) 6 (e) 
4. (a) 3 (b) 
(d) ад (e) 
(ga (8) 
5. (а) e (b) 
(d) 1 (e) 
6. (a) x (b) 
(4) 18(/4) (e) 
7. (а) Ina+Inb 
(c) 6 
(e) 8x 
(oy te 5 
9. (а) у= (x—-1)? 
() у-х2-1 
(е) f(g(2))=2 
Chapter 5 
1l (90 b) ; 
Q1! o5 
(g) 6 
2. (a) —1 ОЕ 
(d) 0 (e) 1 


(c) 
(f) 


(c) 
(f) 


(c) 
( 1 


(c) 
(f) 


(c) 
(f) 


(c) 
(f) 


(b) 2lgx 
(d) Sldx 
( x-3 


b) y = 


x241 
х2-1 


9 (80) -5 


 f(g(1)--1 


(b y= 


(c) -1 


(t) 2 


(c) 5 
(f) 0 
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3. (a) At x = 0 the function is continuous 
but not differentiable. It is shown in 
Fig. 5.40. 


Fig. 5.40 


(b) 


The function is shown in Fig. 5.41. 
It is discontinuous at the points x — 
1; 2; 3;... 


Fig. 5.41 


(c) The function is discontinuous at the 


points x — 2, 4, 6, 8,... 


3 4 5 6 17 
4. (а) 55 сай QN QU AT 60 
z 7.28 
1— (1/2)8 511 
b =3 = 
(omg ea. 2512 
ғ 5.988 
1 
S =3x —-—6 
(c) x 1/2 
5. (a) End points of the secant: P4 (1, — 1), 
Р 3 3 
TUUS 
Ay 
SI f th t m, = — —2.75 
ope of the secant: ms = 77 
Slope of the tangent: тт = у'(1) = 1 
d 
(b v(t) = ČŽ =6t -8 , v(3) = 10m/s 
(с) 0) dy = (2x4 7) dx 


(ii) dy = (5x* — 8x?) dx 
(iii) dy = 4x dx 


10. 


11. 


(a) 15x4 (b) 8 
(c) Цан (d 21х2-6ү/х 
х2-2 
(a) 6x? (b) шиль 
3Vx2 
2 8 
© x? @ (4+ x)? 
1 
(е) 6х(х2+2)2 (f Ax? — => 


© ——— ELI 
8 3/1 +x2 x2 x 
—18 sin 6x (b) 8л cos (27x) 
Ae * [2л cos (2z x) — sin (27 x)] 

1 2 2 
—— (e) cos^ x —sin^ x 
x41 
2x cos x? (g 12x(3x? +2) 
ab cos(bx+c) 0) бх? е(2х2—4) 


(а) y' 2 0.1 С cosh (0.1x) 

(b) u’ = л [1 — tanh? (v + 1)] 

(c) n’ = tanh £ 

(а) s = tanh £ 

(e) y «0 (since y = —1 = constant) 
(f) y' 22 coth x (1— x coth x) 


ДУУЛЬ эхин es 
1+ 100x2 
C 
Bru Ee ЕЕ 
сун 02 +20 
he 
(c) 7) Tut 
(d) y' : 


x44 x?(x — 1)? 


12. 


13. 


14. 
15. 


Answers 
(a) a cos 6 + — 
a) a cos a 
cos? $ 
(b) e" (2 4- wu) 


1 
(c) y 
(d) 120x 


(а) x1 =2 
X2 = —2 
Хз = Х4= 0 
(фух-Кл, 


(У2, —8), minimum 
(0, 0), maximum 
(V2, —8), minimum 


2 9л : 
27572572 ,...maximum 


llr ша: 
х “Бин EO minimum 
(c) x =2zk , 


=E; 


97 ,... maximum 


Yi 


llz,...minimum 


Зл, 
(4) х = 54, попе 
(e) x = 2k +1 


-2 
2 ? 
x = 2kz — 2, maximum 


(k — 0, 1, 2322) 5 


x = (2k 4- 1)z — 2, minimum 


1 
(f) x1 — 4.85, (A 34 , maximum 


хо = 1.85, (3, —18), minimum 


Хз = 0, 


(а) у= 11 


(b) y = —8.89 


Answers 


Fig. 5.42 


16. (a) 2.44%, 1.63% 


17. (а) 5.27 


(b) 14.14 in magnitude 


18. (a) 1 
ded 
(D 73 


(b) 0 


(b) 12.95m 


1 1 
Hint: - 
x ( sinhx tanhx 


(In x)/cot(zr /2)x 


(e) e 2/7 
Hint: x (1/2) =e 
b? — a? 
0 
(b () —; 
—2x 
19. маа 
9:(8) y= 
9х2у2-4-сов 
(b) y! сан = Е y 
6x?^y—x sin y 
2(x+y)+2 
(c) y! "Sn ( y) =, 
2(x+y)+1 
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20. (а) y' = 30x —29 


(b) у= xsinx (== + cos x nx) 
X 
(c) y' 2 x*(14-In x) 


21. (а) y' (vu — gx)/u? 
(b y’ =tan t 


22. x(t)=R cos Злі 
y(t) = К sin3zt 


23. (a) It is a straight line in three-dimen- 
sional space (Fig. 5.43) 


Fig. 5.43 


(b) The curve is an ellipse with axes 
of lengths 2a and 2b, respectively. 
b2x2 +a2y2 = a2b2 


S e 


24. (a) ax(t) = —ovo cos ot 
)-2-ovo sin wt or 
= (— 000 cos wt, —wVo sin wt) 


( 
= (—Ro sin ot, Ro cos wt, 1) 
) = (0, Ro, 1) 
(c) v(t) 3-3 (vo, 0, -81) 


( 
(b w( 


е 


Chapter 6 
1. (а) F( )= 5x2 40; C=5, 
3 xd 
F(x)- 5% ds 
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2. (a) 


3. (a) 


(b) 


(c) 


4. (a) 


(b) 


(c) 


5. (a) 
(b) 
(c) 
(d) 


(e) 


(D. 2 


(g) 


3 
(9 & 


(b) x 


(c) 


(d) 


(e) 


(f) 


(b) 


3 (b) 6 (c) 0 
Ov 0 
E 2d 6, F-|-6|-6 
2 z2 
F =|-2]=2 
2 2 2 4 
F E 2x] JE 2x] =4 
2 o L2 2 
d /x—1 2 
dx \x+1 (x 4-1)? 
d T. y 
ar] g sin (8x 2) 


= 1 — cos {2(4x – 1)} 


= 1 — cos? (4х — 
= sin? (4x 2 1) + 
= 2 sin? (Ax — 1) 


1)+sin? (4x — 1) 
sin? (4x — 1) 


1-х2 


4 х = 
dx \1+x2 


In(x—a)+C 
tanx+C 


(14 x2)? 


a In (x+y x2 a2) +С 


F(a апа cos а) - C 


Z (2Inx-1)+ 
2 


sin x 4- 2x cos 


(3 Inx—1)+ 


=( 


1. п-1 23 
—sin x cos" 1 х fo 2 хах 
п n. 


xnl 


C 
x-2sinx+C 


C 


x?42a?)—2axcosh х -С 
а а 


(n+1) 


ша cos(z x) 4- C 
л 


е3х-6 +С 


(1-1) In x 


11-С 


1 
(c) 5 In|2x +a|+C 


1 6 
(d) 64 a +) +C 
8. (а) In y|sin 2x|+ C 
(b) Inja+x?|+C 
1 40 
(c) то nlx +21/+C 


10. 


11. 


1 

шин cosh u is 
1.5 
(а) 5 sin 


2 13.5 13/2 
() (3x9 - 192 +С 
(с) уа-х2-С 


(d) Эс (х2)-С 


(a) ш|е"--1/-С 


(b) sin (x 


1 
(c) sin x — = sin? x+C 


(d) In 
(e) In 
(f) In 


(a) In 


(b) In 


(c) In 


(d) In 


(e) 


(x — 1)5/3 
x3/2(x +2)1/6 


(х= 1)!/2(x – 39/2 


+C 


х2-2 
х2-1 


(f) In 


(g) In 


1 х-1 
rin 
3(x 4-2) x+2 


x?—- x41 ын 
(Ээ) 
(х1) 
(х2 +2х + 5)1/2 


—2tan^! E 


Answers 


1 
x +2 sint х+5 sin? x +С 


2)*€ cos x 4- C 


Answers 


Note: 


12. 


13. 


14. 


15. 


16. 


17: 


18. 


х2 +15 


(x — 1)(х2+2х+1) 
2 x+5 


х-1 х2-2х-5 
(a) -325 (b)n2 
(c) 1.416 (d) 75 


255 27 
(а) A. (b) 4.5 (c) те 


1 1 
ок 
Ge Oz De 
Q! me 


О = (2(xo +2 — xo) + 6(Yo — Yo) 
-1(20-20) Nm=4Nm 


P 
0-1 F-dr= 
JO 
5 25, 
[ xdxNm= — Nm 
Jo 2 


Force and path element are perpendic- 
ular. Thus the scalar product F -dr 
vanishes. The line integral is zero. 


The path element: 
= 2 
dr (t) = (-v2sinr, —2sin2t, =) dt 
л 
2t 
Е (1) = (0-2, 222 
л 


U = [°F arte 


7/2 (At 
si ( sin2t 4 2 enar) dt 
JO л л 


8121 tcos2t 
- С 
4 2 i 


4 |sin2t  fcos2t л/2 
4 2 


Since ii t sin2tdt = 


U 


0 


Chapter 7 


1. 720 square units 


2. 9л square units 


3. 2.14 square units 
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. 0.693 square units 
. 100 square units 
2.97 square units 


. 19.24 square units 


on o0 tn R 


(a) 8 square units 
(b) 10.67 square units 
(c) 36 square units 


9. (1) 98.12 units 
(2) 9.42 units 
(3) 4.064 units 


10. 0.1109 square units 


11. (a) 20.47 square units 
(b) 8.34 cubic units 


12. (a) 67.02 square units 
(b) 49.35 cubic units 


13. 314.16 cubic units 


Vip dH ya an 
3л 3л 
15. 236.81 mm 
16. 5/6 from the vertex 
17. 46.875 mm 
18. (a) MR? 
R? 12 
IM (4 
2 
o M (e 18) 
2 6 


19. (a) 32.98kg/m? 
(b) 131.9kg/m? 
20. I — 1.52kg/m? 


21. I = 76.04 x 106 mm^, k = 74.36mm 
Note that the centroid is 77.27 mm from 
the bottom. 


22. 230kN, 6.33m below the surface. 
23. 5.23 х 10? N, 4m 


Chapter 8 
1. (a) f(x) 2 V1—-x 
no 1x? 3x? 


566 


These are two ways of arriving at 
a solution: 

(i) with the help of (8.2); 

(i1) by using the binomial series 


(w= 2; a=1). Е 
(b) f (t) =sin (ot + л) =—-ot 4 2 
ot (0717 
51 Ї 7| Ew 
(c) f (x) = In[(1-- x)5] 
2 24 : 3 2 44 (gute 
5x 2* | 3* 13 | 
Derivatives Values 
d 
f'G)-ac[in(Le3)5] 9-5 
2,253 
(dx 
f'()--5sü-cx)?  f"(9--5 


f"(x)-5x2(1-x)? (0) =5x2 
f A(x) --5х2 


х3(1--х):5 
2 4 6 
(d) f(x)=cosx=1 3i ЕЗ а - 
= = 1 3, 2 5 
(e) f (x) =tan x Vina i 
17 - 
13457 + 
Derivatives Values 
f (x) «tan х 7 (0)--0 
TREE: 1) — 
feo 55 f'=1 
f" | 2 sin x 1 -2ff' f"0)=0 
puc cos X cos2x | es 
FEOF AS) f" 0) =2 


fo (x)=2(f f "-- f! f"42 f' f") 
=2(f f" 3 f! f") 
790д-2(7 f Aff" +f") F0)=16 


790) =0 


cosh x = 14 Ї Eee 


© f(x) 


Answers 


2. (a) The formula (8.7a) cannot be applied 
because every other coefficient van- 
ishes (an = 0 if n is even). Instead we 
use the formula (8.7b): 

1 


jim 
* n 
Jim. Vlan| 


1 
Since ар = ЯГ if n is odd we find 


R= lim уп! = ә 


П оо 
3” 1 
© |) гез 
Un] 3 3 
1 
Therefore R — lim en 2 
nl ду-1 3 


3. (а) Py(x)=x 


Po (x) =0 
1 
P3(x) mici is 


For details of the solution see 1(e). 


tan x 
Fig. 8.8 
x 4 1 
b 
OP) 2-5 4-5 1-х/4 


(geometric series) 
Py (х) = ly Р(х) = bk 1 x? 
1 =A” 2 my! 16 ? 
1 1 1 
P3(x) x+—x? x3 
4 16 64 
P3 is a parabola of the third degree 


4 7 
ith int of inflecti о ре E 
with a point of inflection a ( 3' 2) 
. (a) y=sin x 
(x—z)4 Here 
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5 
7. An intersection at (1. 3) 


2 3 
Xx ox 
=e-l1lex4 3 
Si(x) =e Nhu a 
x3 
Љ(х) = 2 sin х ~ 2x — z 
2 3 3 
р Б 
Ї H =2 
ШОВХ dE 
é 3 2: = 
Fig. 8.9 х? +x“ —2x =0 
xı = 0, у= 0 
5 
Derivatives Values x2=1, ХЭР- 3 
у! =cos х у(л)--1 
y" ——sin x y"(x)-20 
y" = —cos x y"(x)-1 f(x) 
уб) = sin x y9(x)-0 Ax) =e 1 
уб) =cos х у) (л) = –1 


(b) y=cos х= 14 lox x)? 


1 2! 
4 
4! Р(х) = 2 sin x 
5. f(x)=Inx 
-(х-1) x-1? | (x-1)° р... 
2 3 Fig. 8.10 
Derivatives Values 
8. v42 + 6.4807 
(х) = хт! : f'(1)21 Solution 
(х) = (-1)x^ f'(1)2-1 6 
f(x) =2x-3 f"(1)-2 V36+6= 36(1 | x) 
1 
=6х4/1+ = 
6 
4 4 12 36 
= = H 2 
ubi xe EN MI MU 1 l 
( ap. 108, ap | We set x = 6 and apply the expansion 
' 625 l into a binomial series for 
ae aie ЭР 5x4 
T 7772 8 16 128 | 
Derivatives Values (c.f. exercise 1(a)) 
4x3 12 Hence 
TEL OE FOIS 6x4 is bue 1-1» xd 
72 72 Lc E 36 
PO теа YY)" as MM NM ES 
— 3x 
Ї х х Ї ... 
648 648 ) 
f"(x)- T an f"2)- s 16 216 128 1296 
— 3x 
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= 6(14- 0.08333 — 0.00347 


+0.00029 — 0.00003) 
zx 6.4807 
Note that an alternative approach is to 


/ 1 
use /72—7 27 1-5. 


x2 
9. (a) In (1 +x) x — = 
1 x 3 

b ду ь-х2 
PATE 2 8 


10. (a) е025 ~ 1 +0.25 = 1.25 


2 
1 1/1 
(b) n125 1-5 (1) = 7 0219 


EE 1 1 
(c) У125::14-2Х4 


dx х2 x 
11. Ї jis 
(a) hrs Drag gc 
=In(1+x)+C 
Solution: 


d. 
х Ја x4x?—x? 4 x^. )dx 
5 х 


1+х 


Solution: 


Din Xe 6 
Гоха f e : LIE 2 
: : 2! 4! 6! 


0.58 
12. (a) [ У1--х2 dx z 0.6111 
JO 


Solution 
Expand the integrand by means of 
the binomial series. 


Put /1+x2=(14+x2)!/2 and obtain 


1 1 
утзадах-Ї(1 | 3*. gx" 


1 6 5 8 
Vae? 128% 
5 8 
2 414 
128* + ) 


(c) 


Answers 


3 5 7 


x x x X 
v 14:12 dt 
à Tru 40'112 


x 
EL a «1 
st lal<t) 


Substituting the limits x = 0.58 and 
x = 0, we find the value 0.6111. 


X sin t 3 5 
e f sin din x | х 
o t 3x3! 5x5! 
x? ih 
7x7! 
Solution: 
The integrand can be represented by 
the series 
sint А 
—— = — sinf 
t 
1 fp og 
t 31517!) 
12 14 16 
1 Ї I 
3! 5! 7! 
* sint x? x5 
[ dt | 
o t 3x3! 5x5! 
x? 2. 
TXT 
1 ee 1xx? 
13. x -—x4 
(a) sin 33 


У1-х2 
(1х3х5)х7 (1х3х5х7)х? 


2х4х6х7 2х4х6х8 х9 
+- (|x| < 1) 

UN NC E NE 

2 "6 40 112 1152 


T8 1.3167 +- 
Comparing the approximation by the 
first five terms with the numer- 


value 1.5707 ..., we find that the ap- 
proximation is wrong by more than 
16%! Although the sequence con- 
verges, it does so very slowly indeed. 
This series has no significance for nu- 
merical purposes. 


Chapter 9 


1. (а) jv3 (b) 12j 
V5 


M d) 10j 
2) (4) 10) 


Answers 


2.(a) 1 (b -j 
(с) j (d) j 

3. (а) 6jv3 

(b (203-22 + 4/0.6)ј 
(6). -=3 (d) jvab 
(e) 10j (0 -j 
(g 4 h) j 

d) -2V3 gj) 0 


(k) j(a—b) fora>b; 
j(b—a)forb>a 


6i 
(D + A 


a 
4. (a) 7 (b) 15 


1 
5.6) z*-5-2i (9 "= 5 «i3 


3 
6 (8) 21=-2+3) (0) zi =-F 4) 


3 
--2-3)| IRSE 
22 ) 22 4 J 
; 3 
7. (a) 10--3) (b) 5 
8. (a) w—-1-5j (b) 4-1-13| 


9. (а) W=2 (b w-—23-2j 


10. (@) 4424-3109 -5v3-2j 
(c) —0.4+0.7j 
(5 2j 


(f) 8+4jV3 
11. (a) (2x +3jy)(2x — 3jy) 
(b) (Va +јуБ) (уа —jv b) 


Fig. 9.21 


13; 


14. 


15: 


16. 


17. 


18. 


19. 
20. 


21. 


22. 


23. 
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211-2-| 212-1-1 7z3=-3 
24 2 j 25 = 2) 26 2 j 


3 3 
(a) z = V3 (cos E +jsin SE ) 
5 5 
(b) АСЕ sin) 


5 5. 
(a) z-3-35jV3 
(b 2--2үу2-2)у2 


(а) 2 = 6(cos 60? +jsin 60°) 
-3-3)у3 
(b) z =5(cos 120° +jsin 120°) 


5S 
mellm rus, 
205159 
(а) 2 = cos 45? +jsin 45° 
1 1 
=-vV2+-jv2 
5 V2 t 3iV2 


(b) z =cos 330? +jsin 330° 

1 1 

m. 

Qus is 
Multiplication by —j — j? means an anti- 
clockwise rotation of 270°. 
Division by —j — j? means a clockwise ro- 
tation of 270°. 


(a) у2(-4--4) (Ы) -1 
(a) (cos 50° —j sin 50°)4 
[cos(—50°) +j sin (—50°)]* 
cos(—200°) +j sin (—200°) 
= cos 200? —j sin 200° 
(b) (cos œ +j sina)” 
—cos (na) +j sin (næ) 


(а) 2 =243) 
22 = -2-3j 

(b) zı =0.966 + 0.259} 
29 = —0.259 + 0.966j 
23 = —0.966 — 0.259j 
z4 = 0.259 — 0.966j 


(a) ef /2 = cos 5 +j sin 5d 
1 1. 

(b) 2131/9 

(а) cosy — 1, a=0 
sina = 0 

(b cosa —-1, a= 
sina = 0 

(с) cos а = 0, 


sina = — 1 
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1 л 
(4) cos 0 = 23, 06 
EE! 
mno 
24. (а) r =e? (5) r =e? 
1 
a=2 санж 
25. (а) ш = e? = — e 
("= уе, а — zx) 
(b) ш =—vVe> 
(r 2 Ve), æ = л) 
1 
(c) ш = —j 
е 


— Rd 
e! 2 
3 


(d) w =e? (cos 1—j sin 1) 
e e? x 0.54 — e? x 0.841j 


26. (a) (i) Re[w(t)] =e! cos 2л1 


(ii) Period = 1 


(iii) Amplitude = e~?x1= 70.135 
е 
; 2t 3 
(b) (i) Re[w(t)] = e^' cos Es 


3 
21 
= =F 
=e Cos ( ) 


(ii) Period = 37 

(iii) Amplitude — e^ cos 3 
x~ e^(—0.99) 
x —54.0 


3 


a 3 , 
л apta cic o us 
27. (a) е7 ——1 (b) Y =- 2) 


28. (a) zi" /z2 =4e 37 = —4 


1. j 
b -ол/2 
0-3 3 
29. (а) z? 23207 — —32 
(b) 23 Es 1 3л/4 


30. (a) 21/5 22 e27 =2 
(6) 21/4 = 2 


31. (а) z=3e7 (b z= 2 олз 


3c Zz =7.0711 79 
or Z = 7.0711 e 49 


(b) z = 19.8494 e 9.71413) 
or Z = 19.8494 с-140755/ 


Answers 
33. (a) z 2 1.8134 + 1.7209] 
(b) z = 0.4384 + 0.8988j 
-1 1 
34. (a) х= —, y = —— 
(a) Ja y 5 
(b) 2 = (cos 135° +j sin 135°) 
(= ei377/4) 
35. (a) -v3 +j (b) 2 65/6 


Chapter 10 


1. The linear first- and second-order DEs with 
constant coefficients are (b), (e) and (f). 


2. (a) non-homogeneous, second order 
(b) homogeneous, second order 
(c) homogeneous, first order 
(d) non-homogeneous, second order 
(e) homogeneous, second order 


3. (а) y = Се? + Coe* 

Auxiliary equation: 2r?—12r+10=0 
Roots: r =5, 72-1 

(b) y —el^* (Су +Сәх) 
Auxiliary equation: 4y? — 12r --9—0 
Roots: rj = гә = 1.5 

(c) Complex solution: 
y-e"*[(C, + C3) cos 2x 

+j(C; — C2) sin 2x] 

Auxiliary equation: r?--2r4-5—0 
Roots: r4 1--2), r2 1-2) 
Real solution: y =e *(A cos 2x + 
B sin 2x) 

(d) Complex solution: 
y = e025* [(C1 + C5) cos 0.75x 

+j(Cy — Сә) sin 0.75x] 

Real solution: 
y —e025* (A cos 0.75x -- B sin0.75x) 

(е) у-Сүе2 + C5 e х 

(f) Complex solution: 
y = e0?* (C' cos 0.Ax -jC" sin 0.4x) 
Real solution: 
y —e0?* (A cos 0.4х + B sin 0.4x) 


4. (a) y(x) 2C e-**. (b) у(х) 2 C e?9* 
(c) у(х) 2 € e? 


t? 
5. (a) S(t) ges 
(b) x(t) = cos ot + С + C5 


1 
(D ур=х- 5 


6. (а) yp 22x +1 2 


Answers 


10. 


11. 


12. 
13. 


14. 


15. 


(а) у= Сех +C e73¥/7 —2 


10 
(b у= Сех + C5 e* +х4 


9 


1 
(с) y = С e) +C e* — qx + 


1 13 


(d) y=e *(A cos 2x + B sin 2x) 


gel dios 2x +4 sin 2x) 


17 
3 18 
5x | X 2, 
у(х)=С е t Co eX + gx t oz 
43 
"qas 
(a) y =3 e74 (b) у=е72! е2-1х 
(а) С-0, у(х) =0 
(b С--2, у(х) 2-2e?* 
(0) С =e?, у(х) =e? е2х 
СС, 00) ево 


у(х) = С cos 2x + Сә sin 2x 
(а) у(х) =+sin 2x 

(b) у(х) = cos 2x — : sin 2x 
1 
2 
(d у(х) = = cos 2x +a sin 2x 


(b y=x?+Cx 
(с) y = —2 cos? x + С cos x 


© y= (245) 


2 x 
(а) u=y 2, w-2xu - 2x, 
1 
АЙС е? 
2 
b u-2y и =1, 
х-1 1 
У N41 x In(x +1)2+C 
(с) u=y?, x?w' 4 3xu = 3, 
2420 
Ус \5 753 
2 
(d и=у?, и + 2и= –2(х+1), 
1 4 2x3 
3 cx x 
x 2 3 
1 x 
(a) yes In|2 e* +C] 


(b) 


16. (a) 


(b) 


(c) 


(4) 


17. (a) 


(b) 


18. (a) 


(b) 
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y? «In|C (x +1)?|—2x 
B 2 
— (In|x24- C 
y1-7-x4C 


9 (2y 23- tdv. cya 
дх \ x х2 oy x2)’ 
2 


F=} +4x, y? = Cx- 4х? 


2. (2y — x? sin 2y) =—2xsin2y 


=< ex cos? y), F=y?+x? cos? y, 
y? +х2соѕ2у = С 


д д 2 1 
Эх (2x — 3)=2 3y (3x-+2y), 
3 
3, х?-С 
Е-х7-2ху-3у, y 2297 


Special case 1: и = ех, 

e?* — 3e* cosy = C 

Special case 2: и =e’, 

y+xe 7 =C 

x =e% (Acos t+ B sin t) 

у = 0 |(A—B)cos t+(4+B)sin t] 
x = Acos t + Bsin t 


y 58 3A)cost 5 (A+3B) sing 


(с) х= (4+ Ві) е +(E+Ft)e* 
1 ИЕ 700 
y 208 A- Bt)e 4E VF 
+F te 
Chapter 11 
5 
ear SED 
4s 2 
©) 82-9 e s(s? +4) 
lx 1 -At 
2. (a) 5 sin 5t (b) ze ) 
(c) 20 — cos 31) (d) —6 sinh і 
— Toc» 
(e) t — sin f (f) 29 е 287 


572 
2 
3. (a) y =3 tea) 
(b) ЕО ай 3) 3t 
= sin sin + COS 
os 5 
1 Э, 3 
(c) у= zsint Ї 5 008 f | SE 
4. y 22—3e* +3 е2 
ls We. i-z 
. y= patsy 
2 25775 6 
6. y 225—9e! +5t e! 16e 7/4 


j yee t6. m. 
1 
х-1- зоо? 4et/2) 


8. Q—4x10-^4(1.12sin4471— sin 500ż) 


Chapter 12 
5 х 48 d 3 
y 
-2 -2 4 6 4 
-1 2 5 6 5 
0 6 6 6 6 
1 10 7 6 7 
2 14. 8 6 8 


2. (a) The function z = —x — 2y + 2 repre- 
sents a plane. The intersecting curves 
of the surface are 
(1) with the x — y plane: y=— Iu 


(2) with the x—z plane: z=—x+2 


(3) with the y —z plane: z ——2y +2 


(b) The function z — x24 y? represents 
a hyperboloid of revolution about the 
Z-axis. Intersecting curves with planes 
parallel to the z-axis are parabolas. 
Intersecting curves with planes parallel 
to the x —y plane are circles. 

x? y? 
1 gr 


presents one half of an ellipsoid above 
the x — y plane. The intersecting curves 
with the x —z plane and the y —z plane 
are semi-ellipses. 


3. (a) fx —cos x, 


(c) The function z — 


fy -—sin y 


Answers 


х 


Fig. 12.31 


х 


Fig. 12.32 


х 


Fig. 12.33 


2 -x?y 
(b fy =2xJV1-y*, fy = == 


1— y? 
O fy = 2266-7599, 
fy — —2y e (x7 +y?) 
(d fx =yz+y, fy-xz-x, 


Ј = ху+1 


Answers 


x 
(e) fx =e* In y, fy=—, fe =423 
ВА 


O fr =e%* cos x е) sin (x+y) 
fy = — 00S 089) gin (x+y) 


4. Tangent in x-direction: 2x 
Slope in x-direction at point P: 0 
Tangent in y-direction 2y 
Slope in y-direction at point P: 2 

5. Fx = -2 fyx = 0 

ху = Љу =-2 
2х 2х? 


6. With f — 55 e(/»* and fy =- ee 
y У 


the statement follows. 


7. (a) dz = ————— 
V1-x2- y? 
y dx 
1-x2- y? 
(b) dz 22x dx +2у dy 
-2 
(с) dz = 


(х2+у2+22)2 
x(xdx + ydy + zdz) 


8. (a) 7.6% (b) 5.13% 


9. (a) The contour lines are straight lines. 
x 
=- =+1-C 
y 2 + 


grad f = (-1,-2) 
(b) The contour lines are ellipses. 


2 2 
Xx y 2 
nsa es es 
4 i 9 E 
-1 x y 
таа : 
grad f /1—x2/4— y2 [9 (1-5) 


(c) The contour lines are circles 
centered at the origin. 


—10 
grad f= qao) 

10. (a) The surfaces of constant function 

value are planes. 

ху 

ub Te 
grad f = (1,1,—3) 

(b) The surfaces of constant function 
value are cylinders centered at the 
origin. 

C x? i y? 
grad f = (2x, 2y, 0) 


25 
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(с) The surfaces of constant function 
value are spheres centered at the 
origin. 

С =x24y2422 
grad /-3(52--у2--22)1/2 (x, y, 2) 


11. (a) 
3 

2 
() 324442 


sin 21 — 3 cos 2t 


12. (а) 2x +2ay (b) 1 


© -$ @o 


13. (a) Minimum atx = 1, y=1 
(b) Maximum at x = 10, у = 8 
(с) 0 = 30°, h=0.32m, | = 0.20m 


14. Cable (a) 
f (x,t) 
— 0.5cos СЕ 250) m 


or 
f (x,t) =0.5cos 27 (st - 1379) m 
or 
f (x,t) =0.5c0s—% (12.51 x+$)m 
Cable (b) 
x 
f (x,t) =0.2cos2 (0.87 -5 +4) 
The wave velocities are unequal: 
Ca = 6m/s сь = 3.2m/s 
15. of = —2w(vt — x)e- (vt-x* 
92 f Эг 06:50 
OU —-—2v*e (vt-x) 
+ (2v)? (vt — x)2e-(0t-x* 
PF ых) 
ox? 
+4(vt — x)2e- (0t 
Consequently 
Pf. ntf 
д:2 Әх? 
Chapter 13 
1. (a) ab (b) 5 
3 
(c) 4 (4) 12 
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1 
()4 0 c (6471 -e%7)(y1 — yo) 


(b А2 = х2 +у2, г= В, 
in polar coordinates 


_ 4 a? 
(c) P x 937 


4. (a) V =2h(R>* — Rj”) 
(b) V -inR^h 


I= Ž MR (М = total mass = pV) 


5. l= MR, М- 52 R3p 


Chapter 14 


ГА 


1, xQ «x, у= у, 2 =2-2 


2. The transformations аге 
x-x'-2, у-у-3 
Substitution in the equation gives 
y'—--3x'48 


à (188) 
| d-V3«2 


4. The transformation equations are 


x x! УЗ у! 1 y = x! 1 | у! v3 
2 27 2 2 
Substitution in the equation gives 
Pas 6 3 x! 
“AB Wa 


5. The transformation equations are 


x! = 3cos 30° + 3sin 30? 
= 4.0981 to 4d.p. 

y! = —3sin30? + 3cos 30? 
= 1.0981 to 4d.p. 

z! «3 


Hence r’ = (4.0981, 1.0981, 3) 


Answers 


33 
(a) A+B 18 
-19 
-13 
(b A-B 32 
15 
12 42 
(а) 6А-118 0 
54 —6 
25 -8 28 
b) AB=[27 9 0 
80 29 -4 


27 63 
pus 2 3) 
Hence AB 5 ВА. 


No matrix multiplication is possible in this 
case. 


x'V (х-2у 
y] \5x+7y 


т (1 4 3 
10. (a) A 2G 2 2) 
1 2 
(b (AT)T=[4 -3] =A 
з 0 
11. 3. 
54 0.5 4.5 0.5 —3.5 
12. [0.5 26 52 |+ 20 0 ES 
45 52 9 3.5 32 
13. 8-0-21 6400-6 
АА?! s 16-947 1243-2 
8-6-14 6-2-4 
9-0-9 
18-15-3 
9-10-6 
1 (1300 
= = |0130 |-I 
131 о 0 13 
Similarly, AT! A = I. 
Chapter 15 
l. (a) X1 1, x2 6, X3 5 


(b) The second and third equation are lin- 
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early dependent. Thus the solution con- This reduces to x; + y; = 0. A con- 
tains Z as parameter free to venient solution is 
21- 102 1-79 +652 z 1 
25 > 25 ry = _1 
(c) x1 = 13, x2 = 15, хз 20 For Az = 5, solve 
0.42 – 1.52 0.24 — 1.82 
а = Де 2 x 
SUN 012 °” 0.12 ( Я 211218 
The first and third equations are lin- . 2 
early dependent. іе. – 1х2 +2у2 = 0 
1х2 = 2y2 =0 
л l = t This reduces to x2 — 2y2 = 0. A con- 
4 2 4 venient solution is 
1 1 1 7-8 
2 
2 @ 15-1 5] © x ( ea) L2) 
3° 3 1 
8 4 8 
3. (a) X1 = хә = х3 = 0 
72 2 
b L-——— = — 
(b) x 20777 10 


4. (a -9 (0) 0  (o-322 
(d) —186 (е) 22 


5. (a) r=2 (b) r=3 


6. (a) det A = —104 Z 0; hence unique so- 

lution. Fig. 16.4 

(b) detA — 0; no unique solution exists. 

(c) detA 52 0; unique solution exists. 

(d) detA = 0; first and third equation are 2 
dependent. 

(e) detA — 0; third equation is a linear 
combination of the first two equations. 


. No. The characteristic equation is a real 
polynomial equation of degree 2z. We know 
from algebra that if z is a complex root 
then z* is a root as well, i.e. this character- 
istic equation has either two complex roots 
or two real roots. 


Chapter 16 3. The characteristic equation is 
(3-4)1-4)-4-42-43-7-0 


There are no real roots, since 


1. (a) The characteristic equation is 


(517 "dd Ато = 2x V4 — 7 are complex numbers. 
= (4—-A)(3-24)-2 4. (a) The characteristic equation is 
= 42 -74+10=0 -1-A -1 1 
A, = 2, А = 5 det | —4 2-1 4 " 
ES 1 5- 
For A = 2, solve 13.612.341 940 
( : i ) ( ыг ) -0, If À is an integral root, then it must 41- 
: yı vide into 24, the last coefficient. 
ie. 2х] +2у = 0 Ay = 2,42 = –2,Аз = 6. 


1х1 +1y; = 0 
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5; 


(b) For Ay = 2; solve 
3x1 — yı + 2, = 0 
— 4x, + 421 =0 
xı + yı + 321 =0 


This reduces to Хү = 2] and yy = 
—2x 1.x 1 = 1 gives the particular so- 
lution: 


ry = -2 
1 
For A> = —2, solve 


X2— У + 22 = 0 
4x2 4y2 422 0 
X2 y2 722 0 


This reduces to x2 — уз + Z2 = 0 and 
X2 — y2 — Z2 = 0. Hence x5 = y» and 


22 = 0. 
Choosing x2 = 1 gives the particular 
solution: 
1 
гэ = |1 
0 


For Аз = 6, solve 


7х3 Уз 2з = 0 
4х3 4y3 423 = 0 
хз + Уз- 23 = 0 
This reduces to x4 + уз — 23 = 0 and 
Хз — Уз +23 = 0. 
Hence x3 = 0 and уз = 23. Choosing 
Уз = 1 gives the particular solution: 


0 
үз = |1 
1 


A, = 1, 42-1. 
For the first eigenvalue an eigenvector сап 
be quickly found: 


(0 


But for A> we should like to have another 
eigenvector which is truly different (i.e. not 
merely a multiple of у). Unfortunately, no 
such vector exists. 


Answers 


Chapter 17 


1. (a) A—4(0,0,1 
(b) A = 4(0,1,0) 
(с) A = 4(1,0,0) 
—A would be a proper solution in each 
case also. 


!1) 


4 
А 


5. Е = (2,2,4) is a homogeneous vector field. 
ГЕ-ал = 0 for (a) and (b) 

6. F (x,y,z) is a spherical symmetric field 
for (a) and (b). Rule 17.8 tells us 
ГЕ-аА = An R? f (R) for R=2 


3R 3 
(a) F(R)= RR 
2 
fF -dA=4n R полк 
R 
(b F(R) = ——— 
(8) ү1--82 
R Ax R? 
F.dA-AxR? 
f ү1-82 V14+R?2 


7. The differential surface element vector is 


dA = (dydz,0,0). 


3 2 
Јал = | гауаг = f zaz | dy 
Jo 70 
9 


n 229 
8. (a) div F =3 
Each point in space is a source. 
(b) div F = 22 
In the plane 2 = 0 no point is a source 
or a sink. All points below are a sink, 
all points above are a source. 


9. (a) curl = (0,0, 1) 
This vector field has curl. 
(b) curl = (0,0,0) 
This vector field is curl-free. 


Answers 


10. We know that curl F = (0,0, 0). Therefore 


fF -ds=0 
JC 


11. Because of curlF = (0,0,0) the line in- 
tegral is independent of the path. Therefore 
we chose the path of integration 
along the Z-axis, 2 = 0 to Z = 3 


f. Feds = [(.y,2)-(0,0,42) 


3 9 
2 dz = 2 
| вг 2 


Chapter 18 


1. Since f (x) is an even function, the coeffi- 
cients bp vanish. 


ao - + [^ f(x) dx =1 


1 x2 
an = = J -n/2cosnx dx 
1 л/2 
= — са 
zn z/2 
1 ( AT . ( ty) 
sin sin 
zn 2 2 
2 ил 
= — sin — 
zn 2 
For и even Алд is zero. Thus the Fourier 
series is 
1 281"! 
Ї 2n-1 
fx) 2*x 2. ani cos(2n—1)x 


2. The function is odd. Thus all coefficients 
Qn аге zero. 


1 JU 
bn == f f (x)sinnx dx 
7 /-л 


10 1 /7 
==] sinnxdx—— | sinnxdx 
JT jJ—x TJO 


[cosn7 — 1] 


-1 1 
1 t 
-| cos(—nz)| TH 


-2 6D" 
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For n even the coefficients 5, are zero. 
Thus we obtain 


4 2 1 
f(x) sin(2n + 1)x 
л —692n-1 
3 1 0 л 
: a - z(-f sinx | sinx) =— 
m -л JO 
1 0 
аһ = — (-/ sinx cosnx dx 
л /-л 
JU 
+f sinx cosnx ax) 
ЈО 
1 Y 1 | [соѕ(и + 1)х 
л 2 (n +1) 
. cos(n — 1) х] 9 
(1-1 d-r 
cos(n +1)x 
(n+1) 
cos(n — 1)x]” 
(7-1) Jo 
4 
—— —— —À-., neen 
an = 4 л(п+1)(п – 1) 
0, n odd 
1 0 
b, = — (/ —sinx sinnx dx 
л 1/-л 


Ps 
«f sinx sinnx J =0 
JO 


The Fourier series of the rectified wave- 
form 


4. A similar function, with the period 27r, has 
been treated in the example on p. 495. 


1-22 1)! 2n-1 
(-1) cos (2% 15 


7009-5552 3n-1 2 


1 
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Chapter 19 The mean value is 
1 2 3 4 
X —2x + 3x + 4x + 5x 
1. Let A, B, C, D, E be the five dishes. The 38 36 38 36 
sample space consists of the following sets 6x + 7X + 8x Эх 
of possible pairs: 3 2 36 2 36 36 
(AB, АС, AD, AE, BC, BD, BE, CD, CE, F10x + 11x + 12x 
DE} 36 36 36 
1 Thus x ST 
цэг 2 ша! x /(хуйх- | х= dx 
3, p=} EE : _4 
: : uid 3 
4. Рд = = 
M 3. P= ei (0.6)8 (0.4)? 
5. P(blue) = 0.8, P(green) = 0.2 8 
Compound probability = 0.16 = 45 x 0.016 x 0.16 
y 1 1l cS 
6 P= = x =- = — : ERN 
36 9 324 4. The random variable which is distributed 
according to 
7. р 1 | 2 | 355 T 8 : 
36 36 36 6 f(x)= е- Х-и)/ ор /2 
8. №= 5! = 120 су2л 
15 15! has the mean value u. Hence it follows that 
9, N= ( 3 ) = = 455 Е - 
31(15 — 3)! (а) X=2, (b Х--4, 
Chapter 20 Chapter 21 
1. The probability distribution for the random ШАН Эн SE S OU enm 
variable ‘sum of the number of spots’ was 2. (a) 
given in Sect. 20.1.1. 
Di Pi = б (0; — 9? 
Random Probability Random Probability (g/ cm? ) (g/ cm?) (g/ cm?)? 
variable variable 
3.6 0.4 0.16 
2 1 8 5 3.3 0.1 0.01 
36 36 32 0 0 
3 2 9 4 3.0 —0.2 0.04 
36 36 3:2 0 0 
3 3 3.1 —0.1 0.01 
4 36 10 36 3.0 -0.2 0.04 
4 2 3.1 —0.1 0.01 
5 36 11 36 3.3 0.1 0.01 
5 1 Sum 28.8 0 0.28 
6 36 12 36 
7 6: 0.28 
36 с? = т 0.035(g/cm?)? 


с = 0.19g/cm? 
p= 3.2g/cm? 


Answers 


(b) Mean value: 


Yu; 12.80 

10 m/s — 1.28m/s 
Variance: 

X(u-v) 0.011 
o? Nol 9 (m/s)? 


— 0.00122(m/s)? 


Standard deviation: 
с = 0.035 m/s 


3. Mean value: 


1 1 
= dx == 
ш n X 2 


Variance: 
82201 ТҮ? 1 
= d 
с 1 (« 2) x 12 


(00 | 0.19g/cm? 
(а) Ом VN 3 


Confidence intervals: 

3.14 g/cm? <p< 3.26g/cm? 
3.08 g/cm? <p< 3.32 g/cm? 
0.035 m/s 
(b) ом = 90355 

Confidence intervals: 
1.27 m/s € v € 1.29 m/s 
1.26 m/s € v € 1.30 m/s 


— 0.06 g/cm? 


— 0.01 m/s 


5. 1696 


6. (a) A-Xy—120x90cm?—10 800cm? 
Calculation of Oy, using Gaussian 
error propagation law: 


2 2 
Oma? = Ax ах +Ау oy? 


= 90? (0.2)? cm^ 
4-120? (0.1)? cm* 


— 468 cm^ 


л ULP. н 


м 
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OMA = 21.63cm? 


А = (10 8003- 21.63)cm? 


b 
a V= AG (2y = 124.79 cm? 
3 12 
M 1000 5 
P = у = 124,55 8/9" 
— 8.014 g/cm? 


Calculation of oy using Gaussian 
error propagation law: 


ð (MN 1. 1 
212812 V  124.79cm? 
1 
= 0.008 — 
cm? 


9 (MN a (6M -18m 
др Хү др \ rD? лр“ 


ом2 = (0.008)? (0.1)? ( Ё y 


cm? 


+(3.88)2(0.01)2 2 
g 


= 0.001 5041 (—) 


ом = 0.039 
cm 


= (8.014 + 0.039) =. 
cm 


| XEmjS;-nmS$ 65.6-5x4x3 


Xm?-nm? 90—5 x42 
Ime. 0.56 
10 
b = Š -am =3 -0.56 x 4 = 0.76 

m m? S mS 
@ (e) (m (всп) 
2 4 1.6 32 
3 9 27 8.1 
4 16 32 12.8 
5 25 3.5 17.5 
6 36 4.0 24 
20 90 15 65.6 
m=4g S=3cm 
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8. 73?2/8" 


Index 


A B 

Base 69, 74 
Abscissa 42 Base vector 471 
Absolute error 122 Bernoulli DE 306 
Acceleration 99, 134, 158, 273 Bernoulli's equations 306 
Acoustic wave 371 Binomial 
Addition coefficient 71,517 

distribution 527, 529, 533 


formulae 62f. ] 
law 511 expansion 517 


theorem 71 
Bound vectors 9 
Boundary condition 147, 276, 291, 293, 294, 


of vectors 4 
theorems 67,411 
Amplitude 55, 59 


: 296, 372 
Angular velocity 131 
Antisymmetric matrix 423 C 
Aperiodic system 297 
Approximate polynomial 233, 243 Cantilever beam 292 
Approximation 228 Cardioid 202 
first 234 Cartesian coordinate system 42, 387 
second 234 Catenary 79 
third 234 Cauchy 232 
Area Center of mass 208, 210 
bounded by curves 191 Center of pressure 208, 222, 223 
function 150, 151 Centroid 208, 210, 398, 550 
in polar coordinates 195 Chain rule : 104 
Characteristic 


ofa circle 196, 385, 388 
Argand diagram 250, 253, 256 
Argument 41, 252, 256 
Arithmetic mean value 525 


equation 454—456, 459 

polynomial 455, 456 
Circle, equation in parametric form 131 
Circular frequency 57, 369 


Asymptote 46, 119 Circulation 481 f. 

Atmospheric pressure 237 Clearing the fractions 172 
Augmented matrix 433, 434, 436 Co-domain 40 

Auxiliary equation 281, 284 Cofactor 439, 443 

Average velocity 98 Column vector 415, 432, 445, 451 


Axial symmetry 390 Combination 516 ff. 
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Common ratio 86 
Commutative law 27 
Complementary 
area 193,194 
function 277, 278, 285 
Complex conjugate 248, 266 
Complex number 247 ff. 
addition and subtraction 249, 251 
arithmetic form 266 
division 250 
exponential form 254, 266 
graphical representation 250 
multiplication and division 258 
periodicity 266 
polar form 252, 261 
product 249 
raising toa power 263 
roots ofa 263 
summary of operations 266 
transformation of one form to another 260 
Complex root 171, 174 
Component 10,43 
Composition 66Ғ. 
Compound 
event 513, 514 
probability 532 
Confidence interval 545 
Conservative field 184, 487 
Continuity 91 
Continuous quantity 522 
Contour line 350, 351, 357, 360 
Contraflexure 117 
Coordinate 11 
system 42, 386 ff. 
Correlation 554 
coefficient 554 
Cosine 58 
function 58 ff. 
function, exponential form 255, 266 
function, integration of 156 
rue 27 
Cotangent 61 
Cramer’s rule 438, 445 ff. 
Critical damping 297 
Cross product 32 ff., 483 
Curl 480 ff., 483 
Curl-free 480, 484 
Curvature 123, 125 
centre of 123 
radius of 123, 125 
Curve sketching 45f., 118 
Cycloid 137 
areaof 194 


Index 


Cylindrical 
coordinates 389 
symmetry 391 


D 


D’Alembert’s solution 372 
Damping 296 
DE see Dfferential equation 273 
De Moivre's theorem 263 
Definite integral 147, 149, 153, 154, 175, 
191, 379 
Derivative 97—99, 114, 145 
ofaconstant 102 
ofaconstant factor 102, 138 
of acosine function 107 
of a curve given in parametric form 136 
of an exponential function 139 
of a function 97 f., 344 
of a function of a function 104 
of a hyperbolic function 110 
of a hyperbolic trigonometric function 
139 
of an implicit function 127 
of an inverse function 105 f. 
of an inverse hyperbolic function 111 
of an inverse hyperbolic trigonometric 
functions 139 
of an inverse trigonometric function 109, 
138 
of a logarithmic function 139 
of a parametric function 129, 133 
partial 347 
of a position vector 133 
of a power function 101, 138 
of a product of two functions 103 
of a quotient of two functions 104 
of asine function 107 
ofasum 102 Ё. 
of a trigonometric function 107, 138 
Designation 177 
Determinant 423, 424, 438 ff. 
evaluation ofa 440 
expansion ofa 439 
of a square matrix 439 
properties ofa 442 
Diagonal 414, 430 
Diagonal form 444 
Diagonal matrix 421 
Difference 
quotient 96,97 
vector 6 
Differential 99, 100, 351 
calculus 98 


Index 


coefficient 97,99, 100 
total 351 
Differential equation (DE) 273 ff. 
exact 308 
first-order linear 275 
general first-order 306 
general linear first-order 302 
higher-order 317 
homogeneous 275, 277 
homogeneous first-order 279 
homogeneous second-order 279, 281, 282 
linear first-order 277 
linear with constant coefficients 275 
linear with constant coefficients, solution of 
328 
non-homogeneous 277 
non-homogeneous linear 285 
second-order 276 
simultaneous first- and second-order 313 
simultaneous with constant coefficients 
330 
solution by substitution 285 
Differentiation rules 138 
Direction 1 
Dirichlet’s lemma 495 
Dirichlet, PeterG.L. 495 
Discriminant 48 
Distributive law 27 
Divergence 461 
Divergence of a vector field 475 
Domain 340 
definition of 39, 64 
of definition 40 
Dot product 24 
Double integral 383, 398 


E 
Eigenvalue 452 ff. 


Eigenvector 452 ff. 
Electrical field 470 


Elemental 
area 388 
volume 392 
Elementary 
error 545 
event 508,509,514, 532 
End term 92 
Equation 
ofaline 26 


ofasphere 406 
Equations, linear algebraic 429 
Error 

constant 537 


random 


systematic 


537 


Error propagation 


Estimate 


of the arithmetic mean value 
of the standard deviation 


of the variance 


Euler’s 
formula 
number 


Even function 


537 


547 


541 


255, 266, 282 


71, 88 


Event 508 


exclusive 


512 


55,495 


statistically independent 


Exact DE 
Expansion 


of a function 


308 ff. 


of the binomial series 
of the exponential function 


Experiment 
Exponent 


508 
69, 74 


Exponential function 
Extrapolation 39 


Extreme point 


value 1 


19, 121 


362 


71 f., 76, 109, 110 


541 


541 


514 


231 


value, necessary condition 
value, sufficient condition 


F 


Factorial n 


516 


Favoured number 


First moment 


509 


Flow density 462 
Flow, partial 467 


Flux 464 
Fourier 


serie 492 


spectrum 


502 


209, 398 


Fourier’s theorem 491 


Fraction 
improper 
proper 

Frequency 

Function 
circular 


170 
170 


369, 492 


40, 41 
52 


continuous 91 
discontinuous 


explicit 


127 


exponential 71 
fractional rational 


implicit 
inverse 
limits of 


127 
50 
125 


91 


170 


228, 229, 235 


230 


364 f. 
364 f. 
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584 


linear 43 

of a function 66f., 67 

of two variables 337, 367 

periodic 54 

real 41 

trigonometric 52 
Fundamental harmonic 495 
Fundamental theorem 

of algebra 170 

of the differential and integral calculus 

149, 152 


G 


Galtonian board 528 
Gauss 470 
Gauss complex number plane 250 
Gauss-Jordan elimination 431, 433-435, 
443 
Gaussian 
bell-shaped curve 240 
elimination method 430, 431 
normal distribution 545 
Gaussian law 470 


General 
solution 276,279 
term 85, 92 
Geometric 


progression 86, 93 
serie 93, 227 
Geometrical addition 4 
Gradient 483 
of a function 357 
ofa line 94 
Graph 42 
Graph plotting 44 ff. 


H 


Hadamard 232 
Half sine wave 498 
Half-life 73 
Harmonic 503 
analysis 491 
oscillator 294 
oscillator, damped 296 
oscillator, undamped 294 
wave 371,373 
Helix in parametric form 132 
Hertz 57 
Higher derivatives 112 
Homogeneous 
equation 279 
linear equations 436 


Index 


Hyperbolic 

cosine function 79 
cotangent 80 
function 78 f. 
sine function 78 
tangent 79 


I 


Imaginary 
number 247, 248, 266 
part 248 
unitj 266 
Implicit function 360 
Impossible event 509 
Improper integral 179 ff. 
convergent 179 
divergent 179 
Increment 122 
Indefinite integral 145, 159 
Indeterminate Expression 125 
Index 69 
Infinite series 93 
Initial condition 291 
Inner 
function 66, 105 
integral 379 
product 24f. 
Instantaneous velocity 98 
Integral 
calculus 145 ff. 
sign 149 
Integrand 149 
Integrating factor 302, 304, 311 ff. 
Integration 
application of 191 ff. 
by part 161 ff., 186 
by partial fraction 170 ff., 186 
by substitution 164 ff., 186 
constant 276 
Intersecting curve 340, 342, 345, 367 
Interval of convergence 232 
Inverse 50 f., 432 
cosine function 65 
cotangent function 65 
function 50, 64, 67, 322 
hyperbolic cosine 81 
hyperbolic cotangent 82 
hyperbolic function 81 
hyperbolic sine 81 
hyperbolic tangent 82 
matrix 424, 432 
matrix, calculation 433 
sine function 64 


Index 


tangent function 65 
trigonometric function 64 
Irrotational field 480 


L 


L’H6pital’s Rule 125 
Lagrange 235 
Laplace transform 321 ff. 
of a sum of functions 326 
of derivates 326 
of products 324 
of standard functions 322 
table of inverse transforms 333 
table of transforms 332 
Last term 92 
Leading diagonal 414, 421, 423 
Leading term 92 
Lengths ofacurve 198 
in polar coordinates 201 
Lever law 30 
Limit 
ofafunction 89, 90 
of asequence 86 
Limiting value 86 
Line integral 181 ff., 480 
Linear 
algebra 413 
factor 456 
Linear DE 274 
Linear independence 280 
Logarithm 74 ff. 
common 75 
conversion of 76 
natural 75 
Logarithmic 
differentiation 128 
function 77, 109 
Lower limit of integration 149 
Lower sum 149 


M 


Maclaurin’s series 229, 232, 237 
Magnitude 2,4 
Matrix 413 ff. 
addition and subtraction 415 
algebra 413 
column of 414 
element ofa 415 
equation 432 
for successive rotation 419 
multiplication by ascalar 416 
notation 434, 445 


of coefficient 432, 433 
rectangular 414 


row of 414 
Maximum 114, 115, 119, 120 
local 113 


of a function of several variables 361 
Maxwell 372 
Mean value 525 ff., 538 
continuous distribution 542 
continuous random variable 526 
discrete random variable 525 
of a function 178 
theorem 178 
weighted 548 
Mechanical work 25, 180 
Meridian 391 
Minimum 115, 119, 120 
local 113 
of a function of several variables 361 
Minor 439 
Modulus 252,255 


585 


Moment of inertia 208, 213, 215, 395, 398 


Monotonic function 51 
Multiple integral 377 ff., 378 
with constant limit 378, 379 
with variable limit 382, 384 
Multiplication of two matrices 417 


N 


N-order determinant 439 
Nabla operator 483 
Natural frequency 294, 301, 316 
damped 301 
undamped 301 
Negative vector 6 
Neutral axis 220 
Newton-Raphson 
approximation formula 239 
Non-homogeneous equation 278 
Non-trivial solution 436 
Normal distribution 73,529, 530 ff., 545 
Normal vector 135, 136 
Normalisation condition 512,524 


Null 
matrix 422 
sequence 86 
vector 6 

о 


Oblique coordinate system 8 
Odd function 55,495 
Order 


586 


ofaDE 273 

of integration 383 
Ordinate 42 
Orthogonal matrix 422, 424 
Orthogonality 458 
Oscillation 

damped 298 

forced 299 

Outcome space 508 
Outer 

function 66, 105 
integral 379 

product 32f. 
Over-damped system 297 


P 


Paired values 41 
Pappus' first theorem 211 
Pappus’ second theorem 212 
Parallel axis theorem 218, 221 
Parameter 129-132 
Parametric 
form of an equation 129 ff., 181 
function 194 
Parent population 540, 542 
Partial 
derivative 347 
derivative, higher 348 
differential equation 371 
differentiation 344 
Particular 
integral 276, 285 
solution 276, 291, 458 
Path element 182,357 
Period 54, 56 f., 59, 369, 492 
Periodic function 491 
Periodicity 263 
Permutation 515,516,532 
Perpendicular axis theorem 217 
Phase 57 f., 59, 370 
Phase velocity 369 
Point charge 470 
Point of inflexion 113-115, 118, 120 
Polar angle 391 
Polar coordinate 195, 387 ff. 
Polar moment of inertia 216, 398 
Pole 45,118 
Polynomial 170 
as an approximation 237 
Position vector 9, 42, 43, 129, 182 
Postmultiplication 424 
Potential 485, 486 
field 486 


Power 69f.,74 
Power series 227—229 
infinite 227 
interval of Convergence 232 Ё. 
Premultiplication 424, 433 
Primitive function 145, 146, 154, 159 
Principal value 264 
Principle of verification 159 
Probability 508 ff., 520 ff. 
classical definition 509 
density 524, 526 
density function 524, 525 
distribution 519, 520, 524—526 
distribution, continuous 522, 523 
distribution, discrete 519 
statistical definition 510 
Product moment correlation 554 
Product of a matrix and a vector 416 
Product rule 103, 161 
Projection 7, 43, 350 
Pythagoras’ theorem 27, 60 


Q 


Quadrant 42 
Quadratic 47, 363 
pure 47 

Quadratic equation 47f. 
root of 48 

Quantity 

dependent 39 
independent 39 
Quotient rule 104 


R 

Radian 52 
Radioactive decay 293 
Radius 


of convergence 232, 233 
of gyration 218, 398 
vector 391 
Random 
experiment 508, 509, 519, 526, 527 
sample 538,540 
variable 519 ff. 
variable, discrete 519 
Random error 552 
Range 
of definition 40, 116, 120 
of value 40,119 
Rank 
of a determinant 444 
of a matrix 444 
Real 


Index 


Index 


matrix 413 

part 248 
Rectangular waveform 498 
Reduction formula 163 
Regression 

curve 552 

line 549 ff., 554 
Relationship 41 
Relative 

error 122 

frequency 510 
Remainder 233, 235 
Resonance 301 
Resultant 5 
Right-hand rule 32 
Rotation 404, 407, 409 

in three-dimensional space 411 

transformation rules for 409 
Row vector 414 


5 


Saddle point 114, 362 
Salient features of afunction 45 
Sample space 508 
Sampling error 544 
Sawtooth function 496 
Scalar 1 ff. 
product 23 ff., 24, 357, 416, 417 
quantity 4 
Secant 96 
Second 
derivative 112,113 
harmonic 495 
moment of area 213, 220 


Second-order determinant, evaluation of 440 


Separable variable 307 


Separation of variables 279, 307, 308 


Sequence 85 ff., 92 
convergent 87, 87 f., 88 
divergent 87, 88 

Series 92 ff. 


Set of linear algebraic equations 429, 445 


Set of linear equations 

existence of solutions 435 ff., 445 
Shift theorem 323 
Sine 53 

function 53f.,55 


function, exponential form 255, 266 


Singular matrix 423 
Singularity 45 
Skew-symmetric matrix 423 
Skew-symmetry 423 

Slope 98, 114, 346 


ofaline 44,94 
Small tolerance 354 
Space integral 378 
Spatial polar coordinate 391 
Special solution 276 
Sphere, equation ofa 343 
Spherical 
coordinate 391 
symmetry 394 
wave 371 
Square matrix 414, 422, 424, 444 
Standard deviation 531, 539, 540 
of the mean value 544 
Standard integral 159 
Static moment 398 
Stationary wave 373 
Statistical 
mechanics 507 
probability 510 
Steady state 300 
Steiner’s theorem 218 
Stokes’ theorem 484, 485 
Straight line 43, 145 
equation in parametric form 131 
Submatrix 444 
Substitution 165 
of limits of integration 177 
Successive 
elimination of variables 430 
rotation 411 
Sumrule 102 
Summation sign 92 
Superposition formula 63, 295 
Surface area of a solid of revolution 
Surface element 461 
vector 462 
vector differential 471 
Surface inspace 350 
Surface integral 464 
Symmetric matrix 423, 424 


T 


Tangent 61, 95, 113, 114 
plane 361 
уесіог 135, 136 
Taylor’s series 236, 237 
Techniques of integration 186 
Term-by-term integration 228 
Theory of errors 507 


587 


Third-order determinant, evaluation of 441 


Torque 30, 31 
Total 
derivative 358, 360 


588 Index 


differential 351 ff. Variate 519 
differential coefficient 359 f. Variation 518 
Trace 456 of parameters 289 
Transformation of the constant 302 ff. 
equations for a rotation 419 Vector 1 ff.,414 
matrix for rotation 421 addition 4,12 
rule 405, 407 component representation 7 
rule for successive rotation 411 magnitude 15 
Transient phase 300 multiplication by ascalar 14 
Translation 403, 404 product 23, 30 ff., 448 
Transpose 422 ff., 458 product, determinant form 35 
Transposed matrix 422 projection ofa 7 
Triangle law 5 quantities 2 
Triangular function 497 representation ofa 3 
Trigonometric functions 52 subtraction 6,9 
Triple integral 378 Vector derivatives 488 
Trivial solution 436 Vector field 461 
homogenous 461, 462 
U Velocity 98, 134, 158 
Vibration 297 
Uniformly loaded beam 118 Vibration equation 108 
Unique solution 435 Volume 
Unit of a parallelepiped 448 
circle 52 of a solid of revolution 202 
matrix 421, 433 ofasphere 394 
vector 10f.,34 
Upper уу 
limit of integration 149 
ши Wave equation 371,372 
V Wavelength 368 
Weight 548 
Variable Weighted average 548 
dependent 41 Work done by the gas during expansion 193 
independent 41 
of integration 149 Z 
Variance 539 ff. 
explained 554 Zero 


of a continuous distribution 542 ofafunction 46 


